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We consider a simple supersymmetric grand unified model which naturally solves the strong 
CP and /X problems via a Peccei-Quinn symmetry and leads to the standard realization of hybrid 
inflation. We show that the Peccei-Quinn field of this model can act as a curvaton. In contrast 
to the standard curvaton hypothesis, both the inflaton and the curvaton contribute to the total 
curvature perturbation. The model predicts the existence of an isocurvature perturbation too which 
has mixed correlation with the adiabatic one. The cold dark matter of the universe is mostly 
constituted by axions, which are produced at the QCD phase transition, plus a small amount 
of lightest sparticles. The predictions of the model are confronted with the first-year Wilkinson 
microwave anisotropy probe and other cosmic microwave background radiation data. We analyze 
in detail two representative choices of parameters for our model and derive bounds on the curvaton 
contribution to the adiabatic perturbation. We find that, for the choice which provides the best 
fitting of the data, the curvaton contribution to the amplitude of the adiabatic perturbation must be 
smaller than about 67% and the amplitude of the partial curvature perturbation from the curvaton 
smaller than 43.2 x 10"^ (both at 95% confidence level). The best-fit power spectra are dominated 
by the adiabatic part of the inflaton contribution. We use Bayesian model comparison to show 
that this choice of parameters is disfavored with respect to the pure inflaton scale-invariant case 
with odds of about 50 to 1. For the second choice of parameters examined, the adiabatic mode 
is dominated by the curvaton, but this choice is strongly disfavored relative to the pure inflaton 
scale-invariant case (with odds of about 10^ to 1). We conclude that in the present framework the 
perturbations must be dominated by the adiabatic component from the inflaton. 

PACS numbers: 12.10.-g,12.60. Jv,98.80.Cq 



I. INTRODUCTION 

Inflation, which was originally proposed [J] as a so- 
lution to the outstanding problems of standard big bang 
cosmology and the problem of unwanted relics, is in good 
agreement with the recent measurements 0i on the 
angular power spectrum of the cosmic microwave back- 
ground radiation (CMBR). Moreover, inflation is now es- 
tablished as the most likely origin of the primordial den- 
sity perturbation from which structure formation in the 
universeproceeded ,^J. According to the usual assump- 
tion 0, P , this perturbation is generated solely by the 
slowly rolling inflaton field of the usual one-field infia- 
tionary models and, thus, is expected to be purely adi- 
abatic. However, although adiabatic perturbations are 
perfectly consistent with the present data, the presence 
of a significant isocurvature density perturbation cannot 
be excluded In one-field inflation, the perturba- 

tions are almost Gaussian, in agreement with the current 
upper bounds on non-Gaussianity from the CMBR data. 
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which though cannot exclude the presence of appreciable 
non-Gaussianity (for a review, see e.g. Ref. Q)- 

Lately, the alternative possibility "o] that the adia- 
batic density perturbations originate from the inflation- 
ary perturbations of some light "curvaton" field different 
from the infiaton has attracted much attention. The cur- 
vaton density perturbations can lead ^ JLQJ , after curva- 
ton decay, to isocurvature perturbations in the densities 
of the various components of the cosmic fiuid. In the sim- 
plest case, the residual isocurvature perturbations are ei- 
ther fully correlated or fully anti-correlated with the adi- 
abatic density perturbation. In general models, however, 
the correlation can be mixed (see e.g. Ref. jlj). In the 
curvaton scenario, significant non-Gaussianity may also 
appear. The main reason for advocating the curvaton 
hypothesis is that it makes the task of constructing 
viable models of inflation much easier, since it liberates 
us from the very restrictive requirement that the inflaton 
is responsible for the curvature perturbations. 

In a recent paper a simple extension 13j of the 
minimal supersymmetric standard model (MSSM) which 
naturally and simultaneously solves the strong CP and ^ 
problems via a Peccei-Quinn (PQ) Q and a continuous 
R symmetry was considered within the general frame- 
work of the standard supersymmetric (SUSY) version 
[Hill of hybrid inflation [l^. It was shown that, in this 
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model, the PQ field, which breaks spontaneously the PQ 
symmetry, can successfully act as a curvaton generating 
the total curvature perturbation in the universe in ac- 
cordance with the cosmic background explorer (COBE) 
measurements 18]. The (intermediate) PQ scale is gen- 
erated by invoking supergravity (SUGRA) and the PQ 
field corresponds to a flat direction in field space lifted 
by non-renormalizable interactions. Moreover, the pa- 
rameter of MSSM is generated from the PQ scale. 

We feel that the standard curvaton hypothesis |3, |3 , 
which insists that the total curvature perturbation origi- 
nates solely from the curvaton, can also be quite restric- 
tive and not so natural. Indeed, in accordance to this 
hypothesis, one needs to impose [iJI an upper bound on 
the inflationary scale in order to ensure that the pertur- 
bation from the inflaton is negligible. This bound can be 
quite strong especially if the slow-roll parameter e (see 
e.g. Ref. happens to be very small, which holds in 
many cases. In generic models, one would expect that 
all the scalar fields which are essentially massless during 
inflation contribute to the total curvature perturbation. 
So, in the presence a PQ field, which can be kept [ll| 
light during the relevant part of inflation, it is natural to 
assume that the adiabatic density perturbation is partly 
due to this field and partly to the inflaton. 

There is yet another reason for abandoning the strict 
curvaton hypothesis. The recent measurements on the 
CMBR by the Wilkinson microwave anisotropy probe 
(WMAP) satellite have considerably strengthened 
0, 0, 113 the bound on the isocurvature perturbation 
which was obtained [U 112 by using the pre- WMAP 
CMBR data. As a consequence, the viability of many 
curvaton models is in doubt. However, allowing a sig- 
nificant part of the total curvature perturbation in the 
universe to originate from the inflaton, we can hopefully 
relax the tension between these models and the present 
WMAP data without losing the main advantage of the 
curvaton hypothesis, which is that it facilitate the con- 
struction of viable inflationary models (see Ref. t23j for 
recent investigations of this possibility). 

The PQ curvaton model of Ref. |0| predicts an isocur- 
vature perturbation of mixed correlation with the cur- 
vature perturbation. The extended set of pre- WMAP 
CMBR and other data which was used ^ to restrict the 
isocurvature perturbation in the model of Ref. led to 
the exclusion of a major part of the available parameter 
space. Including the more restrictive recent WMAP mea- 
surements, the allowed parameter space will certainly be 
further reduced. It is, moreover, quite possible that the 
model is even totally excluded by these new data. So, 
the departure from the strict curvaton hypothesis may 
prove to be vital for this particular curvaton scheme. 

In this paper, we will extend the PQ curvaton model of 
Ref. by embedding it into the concrete SUSY grand 
unified theory (GUT) model studied in Ref. [1^, which 
is based on the left-right (LR) symmetric gauge group 
Glr = SU(3V X SU(2)i X SU(2)fl x \]{1)b-l- This 
model leads naturally to standard SUSY hybrid in- 



flation After the end of infiation, the inflaton 

performs damped oscillations about the SUSY vacuum 
and eventually decays into right handed neutrino super- 
fields reheating the universe. The subsequent decay of 
these superfields to a lepton and an electroweak Higgs 
superfield generates [l^ a primordial lepton asymmetry 
|26j | which is then partly converted into baryon asym- 
metry by non-perturbative electroweak sphaleron effects. 
The observed baryon asymmetry of the universe (BAU) 
can then be easily reproduced [13| in accord with the 
data on neutrino masses and mixing. At reheating, grav- 
itinos are also produced thermally. They decay in the 
late universe leading to hghtest sparticles (LSPs), which 
contribute to the cold dark matter (CDM) in the uni- 
verse. For simplicity, we assume that this is the only 
source of relic LSPs neglecting their direct thermal pro- 
duction. Due to the presence of the PQ symmetry, our 
model contains axions which come into play at the QCD 
phase transition and can also contribute to CDM. 

The PQ field of our model can acquire a super-horizon 
spectrum of perturbations from inflation provided that 
it is effectively massless during the relevant part of in- 
ffation. It can thus act as curvaton contributing to the 
total curvature perturbation together with the inflaton. 
We study the evolution of the PQ field during and after 
infiation by including corrections |2i IZ^ to the PQ 
potential which originate from the SUSY breaking in the 
early universe caused by the presence of a finite energy 
density. We assume that these corrections are (some- 
what) suppressed, which is [13 indeed the case if specific 
Kahler potentials are used. 

The requirement that the PQ field is essentially mass- 
less during infiation yields ^3|> f^i' given values of the 
other parameters, an upper bound on the possible values 
of this field at the end of inflation. Moreover, it im- 
plies that, as inflation terminates, the PQ field emerges 
|Tl| with negligible velocity. There is also a lower bound 
on the value of the PQ field at the end of infiation be- 
low which the classical equation of motion during infia- 
tion for the mean value of this field in a region of fixed 
size somewhat bigger than the size of the de Sitter hori- 
zon ceases to be valid. This is due to the fact that 
the quantum perturbations of the PQ field from infiation 
overshadow its classical kinetic energy density. We will 
exclude this quantum regime since the calculation of the 
spectral index of the partial curvature perturbation from 
the curvaton field in this regime is not so clear. 

The values of the PQ field at the end of infiation which 
are not excluded by the above considerations can be clas- 
sified according to whether they lead to the PQ vacua 
or the trivial (false) vacuum. We find that, generically, 
there exist two separate bands of such values leading to 
the correct (PQ) vacua. One of them lies at relatively low 
values of the curvaton field at the end of inflation, while 
the other lies at values which are considerably higher. In 
all other cases, the system ends up in the wrong (trivial) 
vacuum and thus the corresponding values of the PQ field 
at the end of infiation must be excluded. Our numerical 
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findings show that the (approximate) COBE constraint 
on the CMBR can be satisfied only within the upper al- 
lowed band. This constraint receives contributions not 
only from the curvature perturbation but also from the 
isocurvature one and the cross correlation of the two. 
Note, however, that this constraint is quite approximate 
and can be considered only as an indicative criterion. 

The amplitude and spectral index of the partial cur- 
vature perturbation from the inflaton are calculated by 
employing the analysis of Ref. ^ slightly modified to al- 
low for the possibility that the slow-roll conditions are 
violated and, thus, hybrid inflation ends before reach- 
ing the instability point on the inflationary trajectory. 
The partial curvature perturbation from the curvaton is 
treated in a more accurate way than in Ref. [Tlj | . In par- 
ticular, the evolution during inflation of the perturbation 
acquired by the PQ field from inflation when our present 
horizon scale crossed outside the inflationary horizon is 
considered. It is described by the classical equation of 
motion for this fleld in the slow-roll approximation. Solv- 
ing this equation, we can find the perturbation in the 
value of the curvaton field at the end of inflation. This 
same calculation yields the spectral index for the cur- 
vaton too. For any given value of the PQ field at the 
termination of inflation, we take the perturbed value too 
and follow the subsequent evolution of both these fields 
until the time of the curvaton decay. This yields the 
amplitude of the partial curvature perturbation from the 
curvaton. The total curvature perturbation is then given 
by the appropriate weighted sum of these two uncorre- 
lated perturbations. 

As mentioned already, the baryons and the LSPs in 
our model originate from reheating. They thus inherit 
the partial curvature perturbation of the oscillating and 
decaying inflaton, which is different from the total curva- 
ture perturbation due to the presence of the curvaton. As 
a consequence, the baryons and LSPs acquire an isocur- 
vature perturbation of mixed correlation with the total 
curvature perturbation. The CDM in our model contains 
also axions carrying an isocurvature perturbation which 
is uncorrelated with the perturbations from the inflaton 
and the curvaton. The amplitude and spectral index of 
this isocurvature perturbation is evaluated by following 
the analysis of Ref. We see that, in our model, the 
correlation of the total adiabatic and isocurvature per- 
turbations is mixed. 

For given values of all the other independent param- 
eters, we take a grid of values of the curvaton field at 
the end of infiation and the amplitude of partial curva- 
ture perturbation from the inflaton which cover the up- 
per or the lower allowed band. We calculate the total 
CMBR angular power spectrum for each point in this 
grid. The predictions from each band are confronted 
with the CMBR temperature (TT) and temperature- 
polarization (TE) cross correlation angular power spectra 
from the first-year WMAP observations as well as the 
CMBR data on smaller scales from the arcminute cosmol- 
ogy bolometer array receiver (ACBAR) |30l IsH and the 



cosmic background imager (CBI) experiments. We 
then study the implications of the resulting restrictions 
on the various parameters of the model. We also em- 
ploy Bayesian model testing to compare our model with 
the standard pure inflaton single-field inflationary model 
with scale-invariant perturbations. We are particularly 
interested to see whether the data favor the presence of 
a non-vanishing curvaton contribution to the adiabatic 
perturbation. 

The paper is organized as follows. In Sec.^ we outline 
the salient features of our LR SUSY GUT model which 
solves the strong CP and /i problems via a PQ symme- 
try and leads to the standard version of SUSY hybrid 
inflation. The evolution of the PQ field during and af- 
ter inflation as well as its flnal decay into light particles 
are sketched in Sec. IIIII Section IIVI is devoted to the 
evaluation of the total curvature perturbation which, in 
our case, is partly due to the inflaton and partly to the 
curvaton. In Sec. we estimate the isocurvature per- 
turbations in the relic density of the baryons, the LSPs 
and the axions. The total CMBR angular power spec- 
trum predicted by our model is discussed in Sec. lVIl Our 
numerical calculation and results are presented and dis- 
cussed in Sec. I VIII and our conclusions are summarized 
in Sec. IVIIII Finally, in the Appendix, we review some 
useful concepts and results from Bayesian statistics. 



II. THE LEFT-RIGHT SUSY GUT MODEL 

We wfll adopt here the SUSY GUT model of Ref. ^ 
(see also Ref. 33]) which is based on the LR symmet- 
ric gauge group Glr- The SU(2)l doublet left handed 
quark and lepton superfields are denoted by qi and U 
respectively, whereas the SU(2)fl doublet antiquark and 
antilepton superfields by qf and respectively (i— 1,2,5 
is the family index). The electroweak Higgs super- 
field h belongs to a bidoublet (2, 2)o representation of 
SU(2)l X SU(2)fl X V{1)b-l- 

The breaking of Glr to the standard model (SM) 
gauge group Gsm, at a superheavy scale M ^ 10^^ GeV, 
is achieved through the superpotential 

SWi^KSilj^lj^^M^), (1) 

where Z^, is a conjugate pair of SU(2)/f doublet left 
handed Higgs superfields with B — L charges equal to 
1,-1 respectively, and S" is a gauge singlet left handed 
superfield. The dimensionless coupling constant n and 
the mass parameter M can be made real and positive by 
suitable rephasing of the fields. The SUSY minima of 
the scalar potential lie on the D-flat direction = at 
(5)=0, |(Z^)| = |(f^)|=Af. 

The model also contains two extra gauge singlet left 
handed superfields N and N for solving [13 the ^ prob- 
lem of MSSM via a PQ symmetry T^, which also solves 
the strong CP problem. They have the following super- 
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potential couplings: 



5W2 



2mp 



21,2 



2mi 



(2) 



where A and (3 are dimensionless parameters, which can 
be made real and positive by an appropriate redefinition 
of the phases of the superfields and mp ~ 2.44 x 10^* GeV 
is the reduced Planck mass. 

In addition to Glr, the model possesses three global 
U(l) symmetries, namely an anomalous PQ symmetry 
U(l)pQ, a non-anomalous R symmetry U(l)i?, and the 
baryon number symmetry U(1)b. The PQ and R charges 
of the various superfields are 

PQ: q-,l',S,lj,J'H{0), h,N{l), q,l,N{-iy, 
R: hJj,Jj,,N{0), 9,g^^,^^7V(l/2), S{1). (3) 

Note that global continuous symmetries such as our PQ 
and R symmetry can effectively arise [s^ j from the rich 
discrete symmetry groups encountered in many compact- 
ified string theories (see e.g. Ref. H^)- 

It is well known that the superpotential in Eq. leads 
[Tsl IT^ naturall y t o the standard SUSY realization of 
hybrid inflation |l7l| . In particular, the scalar potential 
which is derived from it possesses a built-in classically 
flat valley of minima at = = and for \S\ greater 
than a critical (instability) value Sc = M. This val- 
ley can serve as inflationary path. Indeed, the constant 
tree-level potential energy density k^M"^ on this path can 
cause exponential expansion of the universe. Moreover, 
since this constant energy density breaks SUSY, there are 
important radiative corrections _16J which provide a log- 
arithmic slope along the inflationary trajectory necessary 
for driving the system towards the vacua. 

It should be noted that the SUSY GUT model con- 
sidered here does not predict the existence of topological 
defects such as magnetic monopoles or cosmic strings. 
In the opposite case, these defects would have been co- 
piously produced |36j| at the end of hybrid inflation. 
The overproduction of magnetic monopoles, in partic- 
ular, would have led to a cosmological catastrophe and a 
modification '36', '37^ of the standard realization of SUSY 
hybrid inflation would be needed to avoid this problem. 
This happens in higher gauge groups such as the Pati- 
Salam group, which predicts the existence of monopoles 
carrying two units of Dirac magnetic charge |38l | . Cosmic 
strings, on the other hand, which are generically present 
in many GUT models liDl > would contribute to the 
cosmological perturbations leading to extra restric- 
tions on the parameters of the model. The reason that 
our model does not predict cosmic strings is that the 
Glr breaking is achieved by a conjugate pair of SU(2)ij 
doublets with B — L = 1,-1 which also break the Z2 
subgroup of U(1)_B-L- This Z2, which does not belong 
to GsM, would have been left unbroken if, alternatively, 
we had used a pair of SU(2)i^ triplets with B — L = 2,—2 
for this breaking. This would have led to the presence of 
Z2 cosmic strings (compare with the Z2 cosmic strings 
encountered in the SO(IO) GUT model of Ref. [H). 



The part of the tree-level scalar potential which is rel- 
evant for the PQ symmetry breaking is derived from the 
superpotential coupling XN'^N^ /2mp in Eq. Q and, af- 
ter soft SUSY breaking mediated by minimal supergrav- 
ity (SUGRA), is given by [H 
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(4) 
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where TO3/2 ~ 1 TeV is the gravitino mass and A is the 
dimensionless coefficient of the soft SUSY breaking term 
corresponding to the superpotential term XN'^N^/2mp. 
Here, the phases a, (p and if of A, N and N are taken to 
satisfy the relation a + 2ip + 2(p = it and \N\, \N\ are as- 
sumed equal, which minimizes the potential. Moreover, 
rotating TV on the real axis by an appropriate R transfor- 
mation, we defined the canonically normalized real scalar 
PQ field = 2N. For |^| > 4, this potential has a local 
minimum at d> — and absolute minima at 



^fa^^ [\A\ + vW^j ^3/2mp (5) 

with fa (> 0) being the axion decay constant, i.e. the 
symmetry breaking scale of U(1)pq. Substituting this 
vacuum expectation value (VEV) into the superpotential 
coupling /3N'^h'^ /2mp in Eq. (|2Jl, we obtain a /i term with 



Amp 



■m3/2, 



(6) 



as desired il^]. Note that the potential Vpq in Eq. Q 
should be shifted by adding to it the constant 



Vb 



X 



12 



12 - 12j -24 
so that it vanishes at its absolute minima. 



ml/^mp, (7) 



III. THE PQ FIELD IN THE EARLY UNIVERSE 

In the early universe, the PQ potential can acquire 
sizable corrections from the SUSY breakin g ca used by 
the presence of a finite energy density p^l2^ l28l| . In 
particular, during inflation and the subsequent inflaton 
oscillations, SUSY breaking is transmitted to the PQ sys- 
tem via its coupling to the inflaton given by the SUGRA 
scalar potential. The resulting corrections, whose scale 
is set by the Hubble parameter H, dominate over the 
contributions from hidden sector SUSY breaking as long 
as H ^ '713/2- For simplicity, we will ignore the A term 
type corrections l^^. To leading order, we then just ob- 
tain a correction Sm^ to the mass^ of the curvaton. For 
a general Kahler potential, Jm^ ~ with either sign 
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possible. However, for specific (no-scale like) Kahler po- 
tentials, it might be (partially) cancelled ^9\. Assuming 
that Sm"^ > 0, we write 

6ml = j^H^ (8) 

where 7 can have different values during inflation and in- 
flaton oscillations. Actually, we must assume that, dur- 
ing inflation, 7 <C f so that the PQ field qualifies as 
an effectively massless field which acquires perturbations 
from inflation and thus can act as curvaton. Fortunately, 
the cancellation of Sm^ during inflation can, in principle, 
be "naturally" arranged to be exact (see fourth paper in 
Ref. ). So, for simplicity, we could take 7 = during 
inflation. On the other hand, large values of 7 after the 
end of inflation would generically lead ji^l to a drastic 
reduction of the density fraction of the PQ field, which 
thus again would become unable to play the role of cur- 
vaton. In view of the fact that, in contrast to the case of 
inflation, it is not so easy to achieve exact cancellation 
of Srn^ during inflaton oscillations, we will only assume 
that, after the termination of inflation, 7 is somewhat 
suppressed, say ~ O.f (or smaller). 

After reheating, the universe is radiation dominated 
and, thus, H ~ l/2t < l/2trch = rinfi/2, where t is the 
cosmic time and trch — Prjj the time at reheating with 
Pinfl being the inflaton decay width. It is easily seen 
that, in this case, H ^ TO3/2 as a consequence of the 
gravitino constraint (Trch < 10^ GeV) [43l| on the reheat 
temperature T^ch, which is given by [3 

Trch = f TT^V (rinfimp)^ (9) 

where is the effective number of degrees of freedom 
(5* = 228.75 for the MSSM spectrum). Thus, the SUSY 
breaking effects from the finite energy density in the uni- 
verse are subdominant compared to the hidden sector 
SUSY breaking effects, whose scale is set by 7713/2- 

The PQ potential can acquire temperature corrections 
too. During the era of inflaton oscillations, they origi- 
nate from the new radiation which emerges from the 
decaying inflaton. It has been shown TTl, however, that 
these corrections are overshadowed by the SUGRA ones 
which, in the latest stages of this era, are, in turn, over- 
shadowed by the terms from hidden sector SUSY break- 
ing. After reheating, the temperature corrections are less 
important than the ones from the hidden sector as argued 
in Refs. js^l and 0- ^he temperature corrections to 
the PQ potential can be ignored throughout. 

We see that, in the early universe, the effective scalar 
potential for the PQ field can be taken to be 

^PQ - ^PQ + ^7'ff'0' + Vo. (10) 

The full effective scalar potential V which is relevant for 
our analysis here is obtained by adding to VpQ the poten- 
tial for standard SUSY hybrid inflation (see e.g. Ref. [^). 



The evolution of the field (j) is generally governed by 
the classical equation of motion 

4> + 3H4,+ V' = 0, (11) 

where overdots and primes denote derivation with respect 
to the cosmic time t and the PQ field (f> respectively. In 
particular, Eq. ((TT|l describes [4g the evolution during 
inflation of the mean value of (/) in a comoving region 
larger than the inflationary horizon. Actually, this equa- 
tion starts to be valid a short time after this region crosses 
outside the de Sitter horizon. The mean value of 0, how- 
ever, in a region of fixed size somewhat bigger than the 
(almost) constant size of the de Sitter horizon satisfies 
|47| this equation during inflation provided that it ex- 
ceeds a certain value (j>Q given by 

V'^^^, (12) 

where Hmfi is the almost constant Hubble parameter dur- 
ing inflation. Therefore, if we require that the value 0/ 
(considered positive without loss of generality) which is 
taken at the end of inflation by the mean </i in a region of 
fixed size somewhat bigger than exceeds (j)Q , we can 
be sure ^] that the classical evolution equation holds for 
this mean field until the end of inflation. For values of 
the mean in a region of fixed size somewhat bigger than 
^inft "^liich are smaller than about 0q , the random walk 
executed |48j by this mean field due to the quantum per- 
turbations from inflation cannot be neglected and may 
overshadow (47,J its classical motion. Thus, in this case, 
the classical equation of motion during inflation for this 
mean (j) ceases to be valid. For reasons to be discussed 
later, we will not consider in our analysis values of 0/ 
which lie in the quantum regime, i.e. which are smaller 
than (j>Q (see Sees. II Vl and IVIl"Xl . It should be pointed 
out in passing that the requirement of complete random- 
ization of the mean </) in a region of fixed size somewhat 
bigger than as a consequence of its quantum pertur- 
bations from inflation implies 49] an even more stringent 
bound on this mean (j> given by y < ^^kifl- 

Moreover, as explained in the next section, we will have 
to study only values of for which is a slowly rolling 
field during the relevant part of inflation (i.e. during 
at least the last 50 — 60 e- foldings). It has been shown 
PH that, in this case, the PQ fleld (p emerges at the 
end of inflation with negligible velocity (i.e. derivative 
with respect to cosmic time). Its subsequent evolution 
during the matter dominated era of damped inflaton os- 
cfllations is given by Eqs. ifTUI) and ((TT|l with H = 2 /'it. 
One finds [ill that, depending on the value of 0/, the PQ 
system eventually enters into a phase of damped oscilla- 
tions about either the trivial (local) minimum of T^pq at 
(f) — Q OY one of its PQ (absolute) minima at = ±/a- 
Of course, values of 0/ leading to the trivial minimum 
must be excluded. 

The damped oscillations of the PQ field continue even 
after reheating, where H becomes equal to l/2t. Finally, 
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this field decays via the second couphng in the superpo- 
tential of Eq. Q into a pair of Higgsinos provided that 
their mass does not exceed half of the mass of the PQ 
field (see Ref. [ll|) 
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(13) 



which is independent of the parameter A. The decay time 
of the PQ field is = PI^, where P^ is its decay width, 
which has been found [ij to be given by 



9-/ 

STTlTlp 



(14) 



Note that the coherently oscillating PQ field could evap- 
orate 50] as a result of scattering with particles in the 
thermal bath before it decays into Higgsinos. However, 
one can show JJ_j that, in the model under discussion 
here, this does not happen. 



unity at the present cosmic time. We define the following 
gauge invariant quantities: 



c 



p 

= -C - H{B - a^v), 
= -C ~ H (^B - a^E^ 



(16) 

(17) 
(18) 



Here, p = —Tq'^ is the total energy density in the uni- 
verse with r^" being the energy momentum tensor, 6p — 
—STq^ is the total density perturbation, and v describes 
the spatial perturbation in the 4-velocity u'* of an ob- 
server comoving with the total fluid, i.e. = —Su^/vP. 
The variable C, represents the curvature perturbation on 
hypersurfaces of uniform density, 7?.rad is the curvature 
perturbation in the (total matter) comoving gauge (up 
to the sign), while $ is the Bardeen potential, which is 
the curvature perturbation (up to the sign) in the lon- 
gitudinal gauge. These three quantities are related by 



IV. THE CURVATURE PERTURBATION 

We will consider here only values of 0/ for which the 
PQ field (f) is effectively massless, i.e. V" < H^, dur- 
ing (at least) the last 50 — 60 infiationary e-foldings so 
that it receives a super-horizon spectrum of perturbations 
from infiation and can act as curvaton. This requirement 
also guarantees that (j) is slowly rolling during the rele- 
vant part of inflation. The perturbation 64> then evolves 
at subsequent times and, when </> settles into damped 
quadratic oscillations about the PQ vacua, yields a sta- 
ble perturbation in the energy density of this field. Af- 
ter the PQ field decays, this perturbation is transferred 
to radiation, thereby contributing to the total curvature 
perturbation. On the other hand, the radiation, which 
originates from the inflaton decay, could carry a curva- 
ture perturbation prior to the curvaton decay too. It 
actually inherits the curvature perturbation of the infla- 
ton. Contrary to the standard curvaton hypothesis 
we make here the more natural assumption that this per- 
turbation is non-zero and, thus, also contributes to the 
total curvature perturbation. 

The scalar part of the metric perturbation for a flat 
universe can be written (using the notation of Ref. [5lj|). 
in all generality, as follows (for reviews of the gauge 
invariant theory of cosmological perturbations, see e.g. 
Ref. [13 ): 

Sg^.ydx^dx" = -2Adt^ + 2aB^,dtdx^ 

W (2C% + E^,j) dx'dx\ (15) 

where fi,!^ ~ 0,1,2,3 and i,j = 1,2,3. Here, x° — t is 
the physical time, x^ (i ~ 1, 2, 3) are the comoving spatial 
coordinates, = dY/dx^ [i — 1,2,3) and 5ij denotes 
the Kronecker delta. The dimensionless parameter a is 
the scale factor of the universe, which is normalized to 



c 



ad 



2p 



'd{p + p) \Ha 



(19) 



where we have used the time-time component of the Ein- 
stein equation and p is the total pressure of the universe 
(i.e. Tij = p6ij). On super-horizon scales, k <C Ha, the 
second term on the right hand side of this equation is 
negligible, and we thus have C = —Ti-rad- 

Using Eq. ((TCI) , the total curvature perturbation [s^l 
in the flat slicing gauge (defined by setting C — E = 
in Eq. (^Sj) is given by 



c 



Sp 



3{p + p)' 

After the curvaton decay, it becomes ^3 

c = (1 - m + fCc, 



(20) 



(21) 



where Ci = ^Pr/^Pr and Qc = Sp^/ip^ are the partial cur- 
vature perturbations on spatial hypersurfaces of constant 
curvature from the inflaton and the curvaton respectively 
at the curvaton decay with pr and p^ being the radiation 
and (f> energy densities respectively, and 5pr and 5p^ the 
corresponding perturbations. Also, 



3p0 



3p0 -I- Apr 



(22) 



evaluated at the time of the curvaton decay. Here, we 
assume that the amplitude of the oscillating has been 
sufficiently reduced so that the potential can be approx- 
imately considered as quadratic. Actually, as shown in 
Ref. |42|| . this must necessarily happen before the curva- 
ton decays. The oscillating curvaton field then behaves 
like pressureless matter and Cc = 250o/3</'O) where is 
the amplitude of the oscillations and <50o the perturba- 
tion in this amplitude originating from the perturbation 
in the value 0/ of at the end of inflation. 



7 



The comoving curvature perturbation 72.iad , for super- 
horizon scales, is given by 

7^rad - f— j a, + fAj — \ a„ (23) 

where k is the comoving (present physical) wave num- 
ber, Hq is the present value of the Hubble parameter 
and ai, dc are independent normalized Gaussian random 
variables. Also, Ai and Ac are, respectively, the ampli- 
tudes of —Ci and — at the present horizon scale (i.e. at 
k = Hq), and Vi and Vc are the spectral tilts of the infla- 
ton and curvaton respectively, which are related to the 
corresponding spectral indices rii and ric hy rii = 1 + 2vi 
and Tie — \ + 2fc. We do not consider running of the 
spectral indices, since this is negligible in our model. 

The amplitude Ai of the partial curvature perturbation 
from the inflaton is found '24| to be 



A, = 



M 
trip 



^/vQ^Aixlr' (24) 



with 



A{z) = iz + l) ln(l + z-i) + (z - 1) ln(l - z'^) , (25) 



dz 



A(z)-\ > 



(26) 



Here, Nq is the number of e-foldings suffered by our 
present horizon scale during inflation, xq — Sq/M with 
Sq being the value of jS*! when our present horizon scale 
crosses outside the inflationary horizon, and Xf = Sf/M 
with Sf being the value of \S\ at the end of inflation. 

In our model, the slow-roll parameters for the inflaton 
as functions of 1 5*1 are given by Q 



K^mp 
8n^M 



zK{zf, 



(27) 



KAnMJ 
+ (3z - l)ln(l - z-^) 



(3z + f)ln(f + Z-1) 



(28) 



where z = x'^ with x = \S\/M. In the presence of the 
curvaton, however, one can show that the slow-roll con- 
ditions (for 7^0) take the form e,\rii\ < 1, where 



H 



(29) 



with 



and Xf is given by the value of x for which these condi- 
tions are saturated. Actually, as it turns out, Xf corre- 
sponds to ?7i = — 1. For K ^ 1, the slow-roll conditions 
are violated only extremely close to the critical point at 
X = 1 (15*1 = Sc)- So, inflation continues practically until 
this point is reaches and, following Ref. [2J|, we can put 
Xf = 1 in Eq. (|26|l . However, for larger values of the 
parameter k, inflation can terminate well before reaching 
the instability point. 

Finally, k, Nq are given by Q 



2tt M 

/No "mp 



Nc 



55.9 



2 K2M 

3 1015 GcV 



1 1 ^rch 

3 ^ 109 GeV 



(31) 



(32) 



(for Hq ~ 72 km sec~i Mpc~^). The spectral index for 
the inflaton is = 1 — 6e -I- 2'qi (for 7^0), where e and 
rji are evaluated at the time when our present horizon 
scale crosses outside the inflationary horizon. Note that 
e enters in this formula, not ei as in the case of pure infla- 
ton. However, ec is normally much smaller than Ci which, 
in turn, is negligible compared to jT^ij. As a consequence, 
~ 1 -I- 2r]i. 

We now calculate the amplitude Ac of the partial cur- 
vature perturbation from the curvaton (p. This originates 
from the perturbation 5(j)^ = (-ff*/27r)ac acquired by (j) 
from inflation when our present horizon scale crosses out- 
side the de Sitter horizon (i7* is the inflationary Hubble 
parameter at that moment). In order to find the evo- 
lution of this perturbation during the subsequent part 
of inflation and estimate its value 5<j)f at the end of in- 
flation, we must consider the equation of motion for 
during inflation (see Eq. (|ll(l l. In the slow-roll approx- 
imation, which is assumed to hold for the curvaton too, 
this equation reads 

iH^+V = Q. (33) 
Taking a small perturbation (50 of 0, Eq. H33|) gives 



?,H6^ + 2,H'6(j)^ + V"6(j> = 0. 



(34) 



Substituting from Eq. (|33ll and using the Friedmann 
equation SH'^rrip = V, Eq. becomes 



' + H{-ec + Vc)S(t) = 0, 



where 



' V ' 



(35) 



(36) 



Integration of Eq. (|35|1 from the cosmic time when 
our present horizon scale crossed outside the inflationary 
horizon until the end of inflation (at time t/) yields 



1 2 fv 

tr. = -Top , y 



(30) 



2^ 



Oc exp 



Nq 



{tc - ilc)dN, 



(37) 
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(38) 



where we used the relation dN = —Hdt for the num- 
ber of e-foldings N{k) — Nq + ln(iJo/fc) suffered by the 
scale which corresponds to the comoving wave number k 
during hybrid inflation. 

For each value (pf (> 0) of the curvaton field at the end 
of inflation, we construct the perturbed fleld (jjj + 6(j)f. 
We then follow the evolution of 0/ and (j>f + S(t)f until 
the time of the curvaton decay and evaluate the value 
of Sp^/p^ at this time. The amplitude Ac of the partial 
curvature perturbation from the curvaton is given by 

1 5p^ 
3 P4, ' 

We have found numerically that the perturbation S(j)o in 
the amplitude of the oscillating curvaton at is propor- 
tional to S(pf. So Cc has the same spectral tilt as 6(j)f, 
which can be found from Eq. H37|) : 

din Ac 

I'c = ^^^^ = -e-(c + ric, (39) 

where we used the relation dink — Hdt, and e, Cc and 
77c are evaluated at t^. (Note that a similar formula has 
been derived in the first paper in Ref. but without 
the — ec term in the right hand side.) The spectral index 
for the curvaton is then Uc = 1 — 2e — 2ec -I- 277c which, 
in most cases, reduces to nc — 1 + "irjc since we typically 
have Ci, tc < \ric\- 

It should be pointed out that, in deriving Eq. (|39|) . 
we assume that, during inflation, the mean value of 
in any region of fixed size (somewhat bigger than) 
is initially the same and follows the classical equation 
of motion (Eq. This mean field sets the (initial) 

value of the mean </> in any comoving region at the time 
this region exits the de Sitter horizon. Thus, if it satis- 
fies the above requirements, we can be sure that, at any 
given time during inflation, the resulting mean (f> in each 
comoving region that already exited the horizon is prac- 
tically independent from the size of this region. So the 
mean in a flxed region of size H^_^ or in any comov- 
ing volume (after horizon exit) is described by the same 
(single-valued) function of the cosmic time and evolves 
classically (in fact, it rolls down slowly in our case). The 
derivation with respect to time of the logarithm of the 
amplitude of 54>f, which is given by Eq. (|37|l . is then 
straightforward leading to Eq. On the contrary, if 

the mean fleld in a flxed region of size somewhat big- 
ger than executes a random walk with step of am- 
plitude 7Jinfl/27r per Hubble time, the calculation of the 
spectral tilt becomes less clear. So, we decided to exclude 
the quantum regime (i.e. the region </>/ < 4>q) from our 
analysis (see Sec. I VII It is true, however, that it is by 
no means necessary to avoid the random walk behavior 
at all times during inflation. Indeed, only cosmological 
scales corresponding to a few e-foldings after the exit of 
our present horizon scale from the de Sitter horizon are 
relevant. For simplicity though, we exclude all the quan- 
tum regime so that no random motion of the mean in a 
region of size is encountered during inflation. This, 
as we will see, has no influence on our results. 



V. THE ISOCURVATURE PERTURBATION 

After the termination of inflation, the inflaton per- 
forms damped oscillations about the SUSY vacuum and 
eventually decays into light particles reheating the uni- 
verse. At reheating, gravitinos are thermally produced 
besides other particles. They only decay, though, well 
after the big bang nucleosynthesis (BBN) since they have 
very weak couplings. Each decaying gravitino yields one 
sparticle subsequently turning into the LSP, which is sta- 
ble. These LSPs survive until the present time contribut- 
ing to the relic abundance of CDM in the universe. For 
simplicity, we assume that the thermally produced LSPs 
can be neglected, which holds in many cases. So, all 
the relic LSPs come solely from the decaying gravitinos. 
Baryons can be produced via a primordial leptogenesis 
|26l | which can occur (2^ at reheating. 

We see that both the LSPs and the baryons origi- 
nate from reheating. Their partial curvature perturba- 
tions, ^LSP and Cs respectively, should thus coincide with 
the partial curvature perturbation of the radiation which 
emerges from the inflaton decay, i.e. 



Clsp — Cs — Ci- 



(40) 



The isocurvature perturbation of the LSPs and the 
baryons is then given by 



5, 



LSP+S 



3(C: 



LSP+B 



- C) = 3/(C. - Cc), (41) 



where Clsp+s = Clsp = Cs is the partial curvature 
perturbation of the LSPs and the baryons and we used 
Eq. (|^ . Here, we assume that the curvature perturba- 
tion in radiation (C7) practically coincides with the total 
curvature perturbation. This corresponds to a negligible 
neutrino isocurvature perturbation, which is 10] the case 
provided that, as in our model, leptogenesis takes place 
well before the curvaton decays or dominates the energy 
density. Applying the definitions which follow Eq. I|23f) . 
Eq. iglj takes the form 

SLSP+B = -ifAJ—\ d, + 3fAc(—j ac. (42) 

Our model contains axions which can also contribute to 
the CDM of the universe. They are produced at the QCD 
phase transition, which occurs well after the curvaton 
decay. They carry an isocurvature perturbation, which is 
completely uncorrelated with the curvature perturbation 
and can be written as 



(43) 



where Aa is its amplitude at the present horizon scale, 
Va is the corresponding spectral tilt (yielding the spectral 
index = 1 -I- 2i'a) and So is a normalized Gaussian 
random variable which is independent from ai and ac- 
The amplitude Aa is given by 



A„ 



(44) 



9 



where 9 is the so-called initial misalignment angle, i.e. 
the phase of the complex PQ field during (and at the 
end of) inflation and <i)s. is the value of (/) at t*. In our 
case, the angle 9 lies 11] in the interval [— 7r/6, 7r/6] with 
all values in it being equally probable. It is determined 
by considering the total CDM abundance ricDNi/i^ which, 
in our model, is the sum of the relic abundance 15411 







0074 f "^^^P \ (-^ 
V200 GeV/ I 109 Gc 



GeV j 



(45) 



of the LSPs coming from the gravitino decays and the 
relic axion abundance l55l 



fa 



1012 GeV 



1.175 



(46) 



Here ^Ij — Pj/ Pc with pj being the present energy density 
of the jth species and pc the present critical energy den- 
sity of the universe. Furthermore, mLSP is the LSP mass 
and Hq = lOO/i km sec~^ Mpc~^ the present Hubble pa- 
rameter. In the numerical calculation of M and the am- 
plitudes Ac and Aa for given Ai and (j)f (see Sees. IVIl"XI 
and IVII^ . we will fix /i ~ 0.72, which is its best-fit 
value from the Hubble space telescope (HST) [E^. (In 
the full Monte Carlo (MC) CMBR an alysis, h owever, we 
do allow h to vary as detailed in Sec. IVIID in This ap- 
proximation has very little influence on the accuracy of 
the results (see the remarks at the end of Sec. IVIIBIl . 
Note that, in deriving the estimate for Vlah? in Eq. (|46|l . 
we applied the simplifying assumptions of Ref. [ij • 

The spectral tilt Va is evaluated by observing that the 
potential of the axion field remains flat until the QCD 
transition is reached. So, there is no evolution of 9 and 
its perturbation 59 = (Tf* /27r (^*)aa after crossing outside 
the inflationary horizon and until the onset of axion os- 
cillations. The axion isocurvature perturbation, which is 
[m equal to 259/9, depends on the scale only through 
7Jh./0*. We find 



TOP V 



--TOP 



TOp 



^^/2^, (47) 



where Eqs. (|30() and (|33|l were used, and the slow-roll 
parameters e and tc are evaluated at t^,. In view of the 
fact that e is normally negligible, Eq. (|47|l reduces to 
Va — TOp-\/2ec/'/>* and Ua — 1 + 2m-p^2tcl ip*. 

Combining Eqs. H42I) and l|43|l . we find that the total 
isocurvature perturbation is [llj 

<, ^LSP+5 g. A_^<^ <: (AQ\ 

where we used the definitions J^lsp+b = f^LSP + and 
= Pm/Pc = ^^LSP+B + with Pm being the total 
matter density at present. 



VI. THE CMBR POWER SPECTRUM 

In order to calculate the expected total CMBR an- 
gular power spectrum Ce in our model, we must first 



specify the various contributions to the amplitude of the 
total adiabatic and isocurvature perturbation as well as 
the cross correlation between these two perturbations. 
In the following, all the amplitudes are referred to the 
pivot scale kp, for which we use the customary value 
kp — 0.05 Mpc~^ (53. We thus define the amplitudes 
of the partial curvature perturbations from the inflaton 
{Ap j) and the curvaton {Ap^c), and the amplitude of the 
isocurvature perturbation in the axions (^p,q) at fc = fcp: 



Av 



A,, 



kp 



Av 



Ar 



kp 



Ap,a^Aa[^^" 



(49) 



where the amplitudes Ai , Ac and Aa at k ~ Hq are given 
in Eqs. and igU) respectively. 

From Eq. we find that the amplitude squared of 
the adiabatic perturbation at the pivot scale is given by 



— (7?.iad'^rad) = Ri + R^, 



(50) 



where 7?.rad is evaluated at A; = fcp, and the inflaton 
{Rf) and curvaton (R^) contributions to this amplitude 
squared are 



i?2 = (1 - /)2a|., and Rl^fAl 



(51) 



The curvaton fractional contribution to the amplitude of 
the adiabatic perturbation is defined as follows: 



pad _ 



(52) 



The amplitude squared of the isocurvature perturba- 
tion at fcp is found from Eq. 14811 to be 



S — (iSi-ad^rad) — + Sc + Sa, (53) 



where 



St = 9/ 



2 / ^^LSP+S \ .2 



^p,ij 



(54) 

are, respectively, the inflaton, curvaton and axion contri- 
butions to this amplitude squared. 

The cross correlation between the adiabatic and isocur- 
vature perturbation at the pivot scale fcp is 



where 



C — (TJ-i-ad^rad) — Ci + Cc, 



Ci = —RiSi and Cc = RcSc 



(55) 
(56) 
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are the contributions from the inflaton and the curvaton 
respectively. Observe that the axions do not contribute 
to the cross correlation (and the amplitude of the adia- 
batic perturbation). Also, note that the isocurvature per- 
turbation from the inflaton is fully anti-correlated with 
the corresponding adiabatic perturbation, whereas the 
isocurvature perturbation from the curvaton is fully cor- 
related with the adiabatic perturbation from it. So the 
overall correlation is mixed. It is thus useful to define ^ 
the dimensionless cross correlation cos A as a measure of 
the correlation between adiabatic and isocurvature per- 
turbations and the entropy-to-adiabatic ratio B at fcp: 



C S 

^^^'^^ RS ™^ ^ ^ R' 



(57) 



The total CMBR ang ular power spectrum is given, in 
e notation of Ref. [51| , by the superposition 

Ce^Cf + Q' + C^g'', (58) 



where 



ad 752;^ad,ni . Tj2^ad,n, 



0« — Jt, (_y , 



Rc^t. 



CIS QZf^'^i 



GrCe 



(59) 



(60) 



(61) 



The above relations hold for the TT, E-polarization (EE) 
and TE cross correlation angular power spectra. 

The TT power spectrum on large angular scales (i.e. 
for i < 20), can be approximated by (see e.g. Ref. 



TT 



27r2 

— [(R^+ASf-4a)fin.,i) 
+ {Rl+4S^,-4Cc)f{nc,e) 



where 



f{n,£) = ir,okpy- 



T{3-n)T{£-^ + ^) 



23-«r2(2 



f)r(€ + f 



(62) 



(63) 



with 770 = 2Hq ^ ~ 8.33 x 10^ Mpc being the value of 
conformal time in the present (matter dominated) uni- 
verse. The derivation of Eq. H62(l assumes that the uni- 
verse is completely matter dominated at the moment of 
recombination, and further neglects the late integrated 
Sachs- Wolfe (ISW) effect from the cosmological constant. 
Therefore this expression is accurate to about 10 — 20% 
and gives an order of magnitude estimate only. For scale- 
invariant spectra, f{n = 1,£) = l/TT£{i + 1) and the 
Sachs-Wolfe (SW) plateau is flat. Notice from Eq. ^ 
that a positively correlated perturbation (as the pertur- 
bation from the curvaton) displays less power on large 



angular scales, because the cross correlation term sub- 
tracts power and can partially cancel the isocurvature 
contribution. On the contrary, a negatively correlated 
perturbation (such as the one from the inflaton) presents 
larger power on the COBE scales and, therefore, can be 
more easily constrained. The COBE measurements give 
[ssj l the following central value for £ = 10: 



£{£+l)Cr/2nl^^^^l.05x 



10 



-10 



(64) 



We will apply this COBE constraint with Cj"^ evalu- 
ated from Eq. H62|) only as an indicative (approximate) 
criterion to get a first rough feeling on the possible com- 
patibility of our model with the data. 



VII. NUMERICAL CALCULATIONS AND 
RESULTS 

A. The cosmological evolution of <j!> 

We are now ready to proceed to the numerical study of 
the evolution of the PQ field during and after inflation. 
To this end, we fix the parameter k in the superpotential 
SWi in Eq. for standard hybrid inflation. Then, for 
any given value of the amplitude Ai of the partial curva- 
ture perturbation from the inflaton, we solve Eqs. H24|l . 
(|31|l and (|32() . where Xf is the solution oi rji = —1 with 
r]i given by Eq. and T^ch, which enters Eq. is 
taken equal to 10^ GeV by saturating the gravitino bound 
[43I I. We thus determine the mass parameter M, which is 
the magnitude of the VEV breaking the Glr symmetry. 
Subsequently, we find the (almost constant) infiationary 
Hubble parameter i/infi = nM^/^/Snip. The parameters 
M and HinR as functions of the amplitude Ai are shown 
in Figs, n and 121 respectively for k = 3 x 10~^ (solid line) 
or 3 X 10^^ (dashed line). Note that much smaller values 
of K would be considered unnatural. On the other hand, 
much bigger k's would push inflation to higher values 
of the inflaton field, where SUGRA corrections become 
important and may ruin 1^ infiation. 

For any given Ai , we chose a value 0/ of (/) at the end of 
infiation, which takes place at cosmic time tf = 2/3-ffinfi. 
We then solve the classical equation of motion for (p dur- 
ing inflation in the slow-roll approximation going back- 
wards in time {t < tf) by taking 7743/2 — 300 GeV (which 
is .59i] approximately its minimal value in the constrained 
MSSM with Yukawa unification [63), |^| = 5 and a fixed 
value for A (~ 10~^) in the PQ potential. Note that much 
bigger values of |^| (recall that |A| > 4) or much smaller 
values of A would not only be unnatural, but also would 
lead to an enhancement of the axion decay constant fa 
(see Eq. ^) and thus require unnaturally small misalign- 
ment angles 9 to achieve the observed total CDM abun- 
dance (see Eq. On the other hand, much bigger 
values of A would reduce fa leading to an unacceptably 
small CDM abundance. Finally, the parameter 7 during 
infiation is put equal to zero for simplicity (see Sec. 
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FIG. 1: The mass parameter M versus Ai for k = 3 x 10 
(solid line) or 3 x 10^^ (dashed line). 



FIG. 2: The inflationary Hubble parameter -ffinfl versus Ai 
for K = 3 X 10"^ (sohd line) or 3 x 10"^ (dashed line). 



We find that, as we move backwards in time, the value 
of increases and becomes infinite at a certain moment. 
Also, the number of e-foldings elapsed from this moment 
of time until the end of inflation is finite providing an 
upper bound iVmax on the number of e-foldings which is 
compatible with our boundary condition (f> = (j)f at tf. 

In order to get a (rough) understanding of this be- 
havior, we approximate the derivative of the potential V 
with respect to by y ~ 3A^0^/16m|, which holds for 
(j) —^ oo. The slow-roll equation for (j) during inflation can 
then be solved analytically and yields 



1 + , , y At 



(65) 



where At = t — tf < 0. It is obvious from this equation 
that, as At — s- At^iin = — 4TO|i?infl/A^0^, </) ^ oo. This 
implies that the maximal number of e-foldings which is 
allowed for a given value of 0/ is iVmax — — -ffinflAtmin = 
4TOpi7i^fj/A^(/)y. Needless to say that no approximation 
for V is used when we actually calculate TVmax- 

It is clear that we must first impose the requirement 
that iVmax > Nq. The time t* at which our present 
horizon scale crosses outside the inflationary horizon can 
then be determined from Nq — i/infi(t/ — <*). Further- 
more, we demand that, at i*, V" < Hf^^, which ensures 
that this inequality holds for all times between and 
tf. This condition, thus, guarantees that the PQ field is 
effectively massless during the relevant part of inflation 
and can act as curvaton. It also provides us with an a 
posteriori justiflcation of the validity of the slow-roll ap- 
proximation used and ensures that the velocity of <f) at 
the end of inflation is negligible. This masslessness re- 
quirement yields an upper bound on (pf for every given 
Ai and fixed values of k and A. The excluded region in 
the Ai — <j)f plane for fixed k and A is represented as a 



red/dark shaded area in the upper part of this plane. In 
Figs. island^ we show this upper red/dark shaded area 
for K = 3 X 10^3^ A = 10""* (model A) and k = 3 x 10"^, 
A — lO^'* (model B) respectively. The lower red/dark 
shaded area in the Ai — (f)f plane corresponds to the quan- 
tum regime and is also excluded for the reasons explained 
at the end of Sec. lIVK see Figs.^and^J. This also ensures 
that, at any given instant of time during inflation, (p has 
practically the same mean value in all comoving volumes 
(which crossed outside the horizon). So, during inflation, 
(j) can be simply considered as a function of time only. 

We start from any given value of 0/ at tf which is not 
excluded by the above considerations and assume van- 
ishing time derivative of <^ at tf. As explained above, 
this is a good approximation provided that 0/ does not 
belong to the upper red/dark shaded area in the Ai^ <j)f 
plane (see Figs. El and 0J. Moreover, we flnd numerically 
that the subsequent evolution of remains practically 
unchanged if we take a small non-vanishing value of (p 
at tf. Under these initial conditions at t/ , we follow the 
evolution of the field at subsequent times {t > tf) by 
solving the classical equation of motion in Eq. Hll|l with 
H = 2/(3t) for tf <t< treh and H = l/{2{t - trch/4)) 
for irch < t <tip. In the latter expression for H , we sub- 
tracted ti-ch/4 from t in order to achieve continuity of H 
at i = trch- The time of reheating t^ch is calculated by us- 
ing Eq. @ with = 228.75 and the curvaton decay time 
t0 is found by employing Eq. H14|l with /? from Eq. lO, 
where ^ is taken equal to 300 GeV, which is smaller than 
half the curvaton mass (sec Eq. H13|l ) so that the decay of 
(j) to Higgsinos is not blocked kinematically. The param- 
eter 7 in the effective PQ potential of Eq. (|1U|I is taken 
equal to 0.1 after the end of inflation (see Sec. Illljl . We 
flnd that, for flxed k and A, there exist two bands in the 
Ai — (j>f plane which lead to the desired PQ vacua at i^. 
They are depicted as an upper and a lower green/lightly 
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Ai [10-5] 

FIG. 3: The two green/lightly shaded bands in the Ai — (f>f 
plane which lead to the PQ vacua at t,f for k = 3 x 10~^, 
A — 10~* (model A). The white (not shaded) areas lead 
to the trivial vacuum and are thus excluded. The upper 
red/dark shaded area is excluded by the requirement that, 
at tf, V" < i^infl, while the lower one corresponds to the 
quantum regime. The blue/solid line shows the values of Ai, 
(j)f which approximately reproduce the correct value of the 
CMBR large scale temperature anisotropy, as measured by 
COBE (see Sec. lVIIOI for details). 

shaded band (see Figs. El and 0)). The white (not shaded) 
areas in the Ai~ (j)f plane lead to the false (trivial) min- 
imum at = and thus must be excluded. Note that, 
for all relevant k's and A's, the quantum regime over- 
laps (at most) with the lower green/lightly shaded band 
only. However, as we will see later, this band is anyway 
excluded by the data in all cases, the reason being that 
it predicts an unacceptably large isocurvature perturba- 
tion. So, the a priori exclusion of the quantum regime 
does not affect our results in any essential way. 

Besides models A and B (we summarize the corre- 
sponding parameter values in TablcQJ, which will be used 
as our main examples in this presentation, we have also 
studied three extra pairs of values of k and A, namely 
K = 10-3 and A = lO"'^, k = lO'^ and A = lO"'^, and 
K = 3 X 10~^ and A = 3 x 10~* (see below). In the first 
two cases, the behavior was found to be quite close to 
the behavior of model A as depicted in Fig. |31 while the 
latter case behaves similarly to the model B (see Fig.QJ. 



B. The calculation of Ce 

For any fixed pair of values for n and A, we take a 
grid of values of Ai and 0/ which span the correspond- 
ing upper or lower green/lightly shaded band. For each 
point on this grid, we consider (j)f and its perturbed value, 
which is found by adding to 0/ the amplitude of 5(j)f 
from Eq. (|37|) . We then follow the subsequent evolution 
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Ai [10-5] 

FIG. 4: The two green/lightly shaded bands in the Ai — <j)f 
plane which lead to the PQ vacua at for k = 3 x 10~^, 
A = 10"* (model B). The notation is the same as in Fig.O 
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Model A 




Model B 


K 




3 X IQ-^ 




3 X 10-2 


A 






10-'' 




"T-3/2 






300 GeV 




1^1 


m 




5 










300 GeV 




7 




(0.1) during (after) 


inflation 


^rch 


m 




10^ GeV 










200 GeV 





TABLE I: Summary of the fixed model parameters for the 
two representative cases presented in the text, called model 
A and B. The equation where each parameter first appears is 
also indicated. 



of both these fields until the time of the curvaton de- 
cay, where we evaluate the amplitude of 5p^/p(j,. The 
amplitude A^ of the partial curvature perturbation from 
the curvaton is then given by Eq. 

We thus obtain a mapping of the values of 0/ which are 
allowed for any given value of Ai onto the corresponding 
values of the amplitude A^ of the partial curvature per- 
turbation from the curvaton, which is the relevant vari- 
able for the calculation of the CMBR power spectrum. 
For the MC analysis (see Sec. IVIIPjl . we therefore use 
Ai and Ac as base parameters describing the initial con- 
ditions and limit our grid to values of Ai smaller than 
6 X 10^'^ (since larger values would overpredict the tem- 
perature power of the SW plateau). The spectral indices 
rii and for each point of this grid are found by using 
Eqs. (EZI) and apphed at a; = xq, and Eqs. ((201), 
and (|39|l applied at i*. The fraction / in Eq. H22|) is also 
evaluated at i^. 
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The amphtude Aa of the isocurvature perturbation in 
the axions is calculated from Eq. H44|l with the initial 
misalignment angle 9 evaluated, for any given value of 
the total CDM abundance ficDM^^ = ^Lsph^ + ^ah^, 
by using Eqs. (HU and (gH) with mLSp = 200 GeV. This 
value of TTT-LSP corresponds |59l| , in the constrained MSSM 
with Yukawa unification 'GO'], roughly to the value of 
(= 300 GeV) chosen here. Note that, for our choice of 
parameters, the LSP relic abundance is fixed (~ 0.0074). 
The spectral index for axions in each point of the grid is 
found by using Eq. l|T7jl . 

In summary, for any fixed pair of values for k and A, 
we take a grid in the variables Ai and ^/ covering the up- 
per or lower green/lightly shaded band and numerically 
map (j)f into the corresponding value of the amplitude 
Ac (which, of course, depends on Ai too). For any value 
of rish^ and fla.h^, we then calculate the amplitudes 
squared of the adiabatic and isocurvature perturbations 
using Eqs. ^SU^, (|STJ and Eqs. ((23, (ES respectively, the 
cross correlation amplitude from Eqs. (|55|l . (|56|) and the 
total CMBR temperature and polarization power spectra 
via Eqs. 1)58(1 — 1)61 11 . The curvaton fractional contribution 
to the adiabatic amplitude F^'^, the dimensionless cross 
correlation cos A and the entropy-to-adiabatic ratio B 
are found from Eqs. (|5^ and l(57|l . For the MC analysis 
(see Sec. lVTTDII . we need to be able to sample the Ai — Ac 
space in any point as the chains evolve. Therefore, we 
perform a 2-dimensional (2D) interpolation between the 
points on the grid using a bicubic spline. A little care 
is needed regarding the present value of the Hubble pa- 
rameter, which enters the computation of the mass scale 
M and thus Hi^a for given amplitude Ai via its impact 
on Nq (note that the first term in the right hand side of 
Eq. H32|) depends on Hq). As a consequence, the values of 
the amplitudes Ac and Aa will also depend on the value of 
Hq. However, we have found numerically that changing 
the value of Hq around Hq = 72 km sec~^ Mpc""'^ within 
the HST la margin (i.e. by ±8 km sec~^ Mpc^^) has an 
impact less than about 3% on the computed quantities. 
Therefore, in the computation of M and of the ampli- 
tudes Ac and Aa, we fix the present Hubble parameter 
to the HST central value Hq = 72 km sec"! Mpc"^ [s^. 
Clearly, we do allow Hq to vary in the MC analysis (see 
below). In particular, the dependence on Hq of the am- 
plitudes Ap^i, Ap^c and Ap ^ at the pivot scale fcp (see 
Eq. is taken into account. 



C. A toy model for the CMBR 

Before deploying the full MC machinery to derive 
quantitative constraints on the various parameters of our 
model, it is instructive to consider a toy model for the 
pre-WMAP CMBR data which allows us to understand 
the salient qualitative features of the parameter space of 
our initial conditions. This approach has the great advan- 
tage of offering a much more transparent understanding 
of the parameter constraints beyond the black box of the 



numerical MC study. 

A first rough idea about the viability of our model 
can be obtained by using the approximate expressions 
in Eqs. (|62|l and H63I) for the temperature SW plateau 
and requiring that the COBE constraint in Eq. I|64() 
is fulfilled. To this end, we take f^s/i^ = 0.0224 and 
ncBuh'^ = 0.1126, which are the best-fit values from the 
WMAP measurements _2j. (Note that, for this choice of 
parameters, the axions constitute about 93% of the CDM 
in the universe.) We find that the COBE constraint can- 
not be satisfied in the lower green/lightly shaded band 
in the Ai — (j)f plane for any pair of values for k and A. 
The reason is that the relatively low values of (/)/ in this 
band imply small values for 0* too. The amplitude Aa 
then turns out to be quite large as can easily be seen 
from Eq. H44|) . This fact combined with the sizable relic 
abundance of the axions leads to large values of Sa (see 
Eq. I|54(l ) . which yield an unacceptably large contribu- 
tion to the right hand side of Eq. 1)62(1 . In the upper 
green/hghtly shaded band in the Ai — (j>f plane, on the 
contrary, the COBE constraint is generally easily satis- 
fied. The resulting solution is depicted by a blue/solid 
line (see Figs. |21 and 

We are further interested in deriving ball-park esti- 
mates for the values of Ai, Ac- To this goal, we again 
approximate the SW temperature plateau by the expres- 
sion in Eq. ((62)1 . As mentioned, this formula neglects the 
effect of the cosmological constant A (late ISW term), 
but our present purpose is to build an extremely simpli- 
fied toy model, not to include all contributions which are 
fully taken into account in the MC analysis below. There- 
fore, we take one single datum, namely the COBE con- 
straint in Eq. ()64)l . which describes the height of the SW 
plateau, along with its variance i{£+l)ACj'^ /2Tr\^^^Q w 
0.2 X 10~^° [s^. Furthermore, we drop altogether the 
dependence of Eq. 162)) on the spectral indices by taking 
all perturbations to be scale invariant, since the value of 
the indices predicted by our model is anyway very close 
to unity (see Sec. IVIID 2p . We need a second datum 
to be able to constrain two free parameters {Ai and Ac) 
and this is given by the height of the first adiabatic peak 
as measured by BOOMERANG 61] . In the analysis of 
Ref. m, this yields eii + l)Cj'^ /2t:\^^^^^ w 5.4 x 10-1° 
with error £{£ + l)ACj'^ /2Tr\^^^^^ « 0.54 x lO^i". We 
model the theoretical CMBR spectrum at the level of the 
first adiabatic peak by retaining the adiabatic contribu- 
tion only, since the isocurvature temperature spectrum 
drops very fast beyond the SW plateau. Thus the pre- 
diction of our toy PQ model for the height of the first 
peak is given by 

Cl^V" - ^ + Rl) > (66) 

where the constant factor AH « 6.5 approximates the 
ratio of the first peak height to the SW plateau for the 
adiabatic temperature spectrum of the standard ACDM 
model. Once again, this expression is very crude and does 
not account for changes in h, Q\ (= pa/ Pc with p\ being 
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FIG. 5: Posterior marginalized pdf (normalized at peak value) 
for the amplitudes Ai, Ac from our toy model for two choices 
of priors: flat priors (top panels), and Jeffreys' priors (bottom 
panels). The black line is for model A (upper green/lightly 
shaded band in Fig.|HJ and the cyan/light gray line for model 
B (upper green/lightly shaded band in Fig. The con- 
straints on Ai, Ac in model B and on Ai in model A as well 
as the upper limit on Ac in model A are robust with respect 
to the choice of priors. Compare the top panels with Fig. |S| 
which shows the results of the full MC analysis (with flat 
non-informative priors) . 



the dark energy density), fim, flB or (optical depth 
to the reionization epoch), all of which affect the relative 
height of the peak to the plateau, but it is sufhcient for 
our present goal. In summary, the likelihood function 
L{Ai, Ac) of our toy model is given by 



-2lnL{A,,Ac) 



(67) 



a 



TT,PQ 
212 



^212 



TT 



AC. 



where C^^'^'^ can be found from Eq. 162() . The posterior 
(see Appendix for definitions) is then 

ViA,Ac) oc LiA,,Ac)niAi)7:{Ac). (68) 

We adopt non-informative flat priors on the amplitudes 
Ai,Ac, so that Tr{Ai) = 7r(Ac) = constant. An al- 
ternative choice is Jeffreys' prior, which is of the form 
7r{Ai) = l/Ai, tt{Ac) = 1/Ac corresponding to a flat 
prior on log Ai, log^c- This prior implies that we do not 
have any idea on the scale of Ai, Ac before seeing the 
data and thus represents a quite extreme choice of prior. 

In Fig. |S1 we present the posterior marginalized prob- 
ability distribution functions (pdf's) for Ai, Ac from our 
toy model for the two choices of priors above. We ob- 
serve that there is a qualitative difference between the 
results for the upper green/lightly shaded band in Fig.|2| 
and ^ of model A and B. For model B, both the inflaton 
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FIG. 6: 1-dimensional (ID) marginalized posterior distribu- 
tion for the two cases considered here: model A (black solid 
line) and model B (cyan/light gray solid line) with only the 
upper green/lightly shaded bands of Figs. |3 and |1| included. 
The red/medium gray line is for the standard pure inflaton 
single-field inflationary case with Harrison-Zel'dovich (HZ) 
spectrum {rii = 1.00 fixed), plotted here for comparison. 
We plot non-smoothed histograms corresponding to top-hat 
biimings to show the resolution of our curves. Model A is 
quite close to the pure inflaton case, except that the curva- 
ton contribution helps in reducing the reionization redshift Zr- 
Model B displays a preference for non-zero curvaton contri- 
bution, a still lower Zr and larger relic abundance of baryons 
{lob ~ i^sh^) and CDM (wcdm = SIcdm^^)- Its quality of 
fit is however poorer (see Sec. EUni for details). We also 
display as dotted smoothed curves the values of the mean 
posterior. All the curves are normalized at their peak value. 



and the curvaton amplitude are well determined even in 
our over-simplified toy model. From the plot, we read off 
that Ai w 2.3 X 10~^, Ac « 70 x 10~^. As a consequence, 
the choice of prior hardly matters, as we would expect in 
a situation where the posterior is dominated by the like- 
lihood. As for model A, we obtain that Ai « 4.9 x 10~^, 
but we can only place upper limits on the value of Ac- 
Since, in this case, Ac is not a well-determined param- 
eter, the details of its posterior pdf do depend on the 
prior. Nevertheless, it is apparent that the upper limit 
is robust, and we can deduce that Ai<50 x 10~^. These 
numbers have to be taken only as ball-park estimates, 
and this is why we do not bother to attach errors to 
them. However, the comparison of Fig. [S] with Fig. El 
which shows the results of the quantitative MC analysis 
including WMAP and other recent CMBR data and all 
the other cosmological parameters, displays an astonish- 
ing agreement between the results of the above toy model 
and of the full MC analysis. The rather precise match 
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of the two results is actually the fortuitous outcome of 
the cancellation between two opposite effects. On the 
one hand, the MC analysis compared to the toy model 
employs much more precise data, which reduce the error 
on the constraints, but, on the other hand, it also in- 
tegrates over the cosmological parameter space (totally 
ignored in the toy model), which enlarges the error on 
the marginalized quantities. 

From the study of the above toy model, we can thus 
draw two conclusions. First, the height of the first adia- 
batic peak to the large scale SW plateau is the key quan- 
tity to constraining - with at least order of magnitude 
accuracy - the curvaton to inflaton contribution in the 
PQ model. Clearly, quantitatively reliable bounds need 
the inclusion of the detailed effect of the cosmological 
parameters on the shape of the power spectra (see next 
section). Second, we have seen that the constraints on 
the inflaton and curvaton amplitudes are robust with re- 
spect to two different choices of (non-informative) priors. 
For this reason, we will, from now on, adopt a flat prior 
on Ai, Ac-, which will be used both for the extraction 
of the constraints on the parameters (see Sec. IVII D 2\i 
and for the purpose of Bayesian model comparison (see 
Sec. lVllU3ll . 



D. The Monte Carlo CMBR analysis 

1. The setup 

We now proceed to describe the setup of the full nu- 
merical analysis confronting the predictions of our PQ 
model with the CMBR data. We constrain the rele- 
vant parameters of our model by constructing Markov 
chains using a modified version of the publicly available 
Markov chain MC package COSMOMC (6^] as described in 
Ref. [iil. As discussed in Sec. |Vl| the total CMBR an- 
gular power spectrum is given by a (anti-)correlated su- 
perposition of adiabatic and isocurvature CMBR modes 
(see Eq. The adiabatic and isocurvature CMBR 

transfer functions are computed in two successive calls 
similarly to the technique employed in Ref. [65l |. For 
fixed values of n and A, the initial conditions are com- 
pletely specified by Ai and or equivalently by the 
values of Ai and Ac as explained in Sec. IVII Bl The 
MC sampling takes as free parameters the amplitudes Ai 
and Ac, the physical baryon and axion densities in the 
present universe ujb = ^sh^ and uja = ^ah^ in units of 
1.88 X 10~^^ g/cm'^, the present value of the dimensionless 
Hubble parameter h = _ffo/100 km sec~^ Mpc^^, and 
the redshift Zr at which the reionization fraction is a 
half (assuming sudden reionization). All other derived 
quantities are computed from the above parameter set, 
as detailed in Sec. IVII Bl In particular, the derived pa- 
rameters Rl, Fc'^, Sf, Sc, Si, Ci, Cc are obtained 
from Eqs. (EH), El and 

For our choice of ttilsp and Trch, we have Qi^gph^ w 
0.0074 = constant, and the total CDM abundance is 



LucBM = ^CBMh'^ = f^LSp'i^+^a^^- Our analysis Consid- 
ers fiat cosmologies only, thus the cosmological constant 
energy density Qa (in units of the critical energy density) 
is a derived parameter, i.e. ^Ia = 1 — (wcdm + ^B)/h'^- 
We assume three massless neutrino families and no mas- 
sive neutrinos (for constraints on these quantities, see 
e.g. Ref. [i^). We neglect the contribution of gravita- 
tional waves to the spectrum, since the tensor to scalar 
amplitude ratio at the SW plateau is proportional to e^, 
which is completely negligible in our case. In summary, 
for a fixed choice of k and A, our parameter space is six 
dimensional: 

e^{ujB,UJa,h,Zr,A^,Ac}. (69) 

We compare the predicted CMBR temperature and po- 
larization power spectra to the WMAP first-year data 
(temperature and polarization) with the routine for com- 
puting the likelihood supplied by the WMAP team [6^ . 
At a smaller angular scale, we add the CBI 's^l and the 
decorrelated ACBAR [3^ |3lj band powers as well. We 
then run = 20 Markov chains starting from randomly 
chosen points in the parameter space. We take particu- 
lar care in ensuring that the starting points are spread 
over a wide range in the Ai — Ac plane. We then check 
that all chains have converged to the same region of pa- 
rameter space. This indicates that this region is indeed 
a global minimum (at least for the range explored by the 
chains). This is the main reason for using a relatively 
large number of chains, since the danger that the chains 
are stuck in a local minimum is great when exploring 
mixed isocurvature initial conditions (see e.g. Ref. iSiJ)- 
A preliminary run is needed to estimate the covariance 
matrix, which is then diagonalized and used to perform a 
final run until each chain contains M = 30, 000 samples. 
The mixing of the chains is checked using the Gelman 
and Rubin criterion £81] , for which we require that the 
ratio of the variance of the mean to the mean of the vari- 
ance among the chains is i? < 0.1 for all parameters. The 
parameter inference is performed on the merged chains, 
which contain around 5 x 10^ samples after the burn-in 
has been discarded. 

As motivated above, we use flat top-hat priors on the 
base parameters 

UJB,UJa,Zr,Ai,Ac. (70) 

The limits of the top-hat prior do not matter for param- 
eter estimation purposes, as long as we check that the 
posterior density is negligible near these limits. However, 
the prior range of the accessible parameter space plays an 
important role in computing the Bayes factor for model 
testing (see Sec. IVII D 3|l . Wc limit the maximum range 
of the present dimensionless Hubble parameter h by im- 
posing a top- hat prior 0.40 < /i < 1.00 and we use the 
result of the HST measurement [5^ 

LHST^exp(^y, (71) 



16 



JJdi drill 


model A (upper band) 


model B (upper band) 




best fit 


ID 68% c.i. 


best fit 


ID 68% c.i. 




0.024 


0.024 ± 0.002 


0.026 
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5.1 ±0.5 


3.1 


3.1 ±0.4 


A, [10-5] 


20.3 


< 37.9 (43.2) 


62.0 


62.8 ± 10.4 




721.2 


732.6 



TABLE II: Best-fit parameter values and sample means with 
la confidence intervals (c.i.) for the ID marginalized distri- 
bution for the upper band of models A and B. We indicate 
upper limits only when the parameter is not constrained from 
below, in which case the first number corresponds to the 68% 
c.i. (1 tail) and the number in parenthesis to the 95% c.i. of 
the parameter. The present Hubble parameter Hq is given in 
km sec"^ Mpc"^. Finally, L* is the best-fit likelihood (see 
Appendix). For comparison, the Harrison-Zel'dovich (HZ) 
pure inflaton model has — InL* — 721.7. 



where ho — 0.72 and cr = 0.08, by multiplying the likeli- 
hood function for the CMBR data by the above Gaussian 
likelihood. 

Parameter constraints will be rather insensitive to the 
details of the prior distribution whenever the posterior 
is dominated by the likelihood. As we have seen from 
our toy model, the broad lines of the constraints for the 
PQ model are indeed robust with respect to the choice of 
non-informative priors. We will see below that the priors 
do play an important role in Bayesian model comparison. 

2. Parameter constraints 

In the Appendix, we summarize some concepts and re- 
sults from Bayesian statistics which will be useful in the 
following analysis (see e.g. Refs. |6S] and 0| for reviews 
and details). We first consider the upper green/lightly 
shaded band of models A and B. In Tables and IIIII 
we present the best-fit values and parameter constraints 
obtained from the MC chains for our base and derived pa- 
rameters respectively. The 1-dimensional (ID) marginal- 
ized posterior distributions for the base and most of the 
derived parameters are plotted, respectively, in Figs. 
and|3 while the 2D contours of the posterior for Ai, Ac 
and the adiabatic amplitudes squared R^, are dis- 
played in Fig. |H| 

In the upper (green/lightly shaded) band of model 
A, the adiabatic amplitude from the inflaton dominates 
with only a modest contribution (~ 10% in amplitude 
squared) to the adiabatic amplitude from the curvaton 
and an even smaller curvaton isocurvature amplitude and 
cross correlator. Note that the axion isocurvature ampli- 
tude is negligible in this case. Due to the small value 
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TABLE III: As in Table HTl but for the derived parameters. 
We do not give c.i. for the spectral indices since the variation 
in their value is less than IQ-^. 



of the parameter /, the inflaton part of the isocurva- 
ture amplitude squared (correlator) is suppressed rela- 
tive to the corresponding part of the adiabatic amplitude 
squared by more than two orders (one order) of magni- 
tude. As a consequence, the power spectra are domi- 
nated by the adiabatic part of the inflaton contribution. 
Moreover, on large scales, we find that the sum of the 
adiabatic, isocurvature and correlation parts of the total 
power spectrum coming from the curvaton almost cancels 
out, since the contribution from the curvaton correlator 
is negative. We can easily verify the behavior just de- 
scribed with the help of Fig.|51 where we plot the best-fit 
power spectra for model A (upper band) divided into 
the inflaton, curvaton and axion contributions to their 
adiabatic, isocurvature and correlator parts. The qual- 
ity of the fit, as expressed by the maximum likelihood 
value — InL* = 721.2, is slightly better than for the 
pure inflaton Harrison-Zel'dovich (HZ) case, which has 
— InL, — 721.7 (see, however, our remarks below regard- 
ing model comparison). This is not surprising since the 
curvaton contribution turns out to play a modest role in 
this model. In particular, the CMBR data put an upper 
bound on the allowed value of the curvaton amplitude, 
which is Ac < 43.2 x 10"^ at 95% confidence level (c.L). 
The total temperature power on large scales is slightly 
larger than the pure adiabatic part - the net effect is to 
increase the height of the SW plateau compared to the 
height of the first adiabatic peak. This mimics the im- 
pact of a larger optical depth (and thus of a larger z^), 
and explains why model A shows a preference for a later 
reionization than in the pure inflaton case. The TE spec- 
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FIG. 7: As in Fig. |S| but for (most of) the derived parame- 
ters (the HZ pure inflaton model is not included here). The 
adiabatic amplitude in model B is dominated by the curvaton 
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FIG. 8: Contours containing 68% and 95% of the probability 
for the joint posterior pdf for the upper band of models A and 
B (same notation as in Fig. |SJ . The base primordial param- 
eters Ai, Ac are displayed in the left panel, while the right 
panel shows the derived adiabatic amplitudes squared from 
the inflaton (Rf) and the curvaton (Re)- Model B prefers a 
non-vanishing curvaton contribution to the adiabatic ampli- 
tude, but its quality of fit is poorer (see text for details). 



trum is dominated by the inflaton adiabatic part, but on 
large scales the curvaton and axion contributions give a 
net power increase. This effect helps in better fitting the 
"reionization bump" (i.e. the power increase for £<10 
due to reionization) at low multipoles reducing the need 
for a rather early reionization. 

The upper (green/lightly shaded) band of model B ex- 
hibits a preference for a non-vanishing curvaton ampli- 
tude with very high significance {Ac — 62.8 ± 10.4) as 
one can also see from Fig. |S1 In this case, the best-fit 
spectrum is a superposition of a dominant curvaton adi- 
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FIG. 9: Best-fit TT (upper panel) and TE (lower panel) power 
spectra for the upper band of model A (parameters as in Ta- 
ble [HJ. The line codes are: red/dark gray for the infiaton, 
green/medium gray for the curvaton, and yellow/light gray 
for the ajdon contributions; dotted for the adiabatic, long- 
dashed for the isocurvature, and short-dashed for the corre- 
lator parts. The total power corresponds to the blue/solid 
line. In the upper panel, the correlator parts are given in 
absolute value. Note though that the infiaton correlator con- 
tribution has to be added (negative correlation), while the 
curvaton one has to be subtracted (positive correlation) to 
obtain the total power. 



abatic part and an inflaton adiabatic contribution which 
is around 50% of the former in amplitude squared (see 
Table Ulll and Fig. I10|) . This time, the isocurvature cur- 
vaton part is sizable on large scales, where however it 
is again cancelled by the correlator. The large value of 
K yields a larger H^, (see Fig. ^ and thus pushes up 
the axion isocurvature contribution (see Eq. H44|) ). which 
again adds power to the SW plateau. As in model A 
(upper band), the isocurvature and correlator inflaton 
parts are negligibly small. For the best-fit parameters, 
the total power in temperature in the COBE region is 
in better agreement with the data, being slightly smaller 
than for model A (upper band), thereby fitting better 
the low-multipole region of the spectrum. However, the 
overall quality of the fit is worse (— InL* — 732.6) be- 
cause the model does not reproduce with enough preci- 
sion the shape of the first acoustic peak. We illustrate 
this in Fig. 1111 where we compare the two best-fit spec- 
tra for models A and B (upper bands). The WMAP 
data around the first temperature peak are of such a 
good quality that they can discriminate between the two 
models thanks to the slightly different shape of the peak. 
The reason why the curvaton/inflaton mixture does not 
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B. The base parameters are according to Table ^ and the 
notation is as in Fig.|U] 



fit accurately enough the first temperature peak in model 
B is twofold. First, the isocurvature and correlator con- 
tributions are still sizable in the region of the first peak 
rise {£ ~ 100), and this increases slightly the total power 
in this part of the spectrum. Second, the curvaton and 
inflaton have two slightly different spectral indices (see 
Table llll)l . and the resulting tilts of the spectra are there- 
fore mismatched. As for the TE spectrum, the isocur- 
vature axion part plays an important role in reproduc- 
ing the reionization bump. Furthermore, model B (up- 
per band) shows a preference for a rather high baryon 
abundance {lob ~ 0.026), which is in strong tension with 
the value indicated by BBN together with observations 
of the light elements abundance, which typically yields 
LJB ~ 0.020 ± 0.002 [73. 

Fig. ^] shows the ID marginalized constraints on 
the dimensionless correlator cos A and the entropy-to- 
adiabatic ratio B, while Fig. 1131 gives the 2D joint con- 
straints for these two parameters. In model A (upper 
band), the dominance of the inflaton and the low ampli- 
tudes of the correlator modes result in a value of the effec- 
tive correlation cos A roughly centered around zero. As 
discussed above, the entropy contributions play a mod- 
est role in model A (upper band), and correspondingly 
we obtain the upper limit i?<0.5 (at 95% c.L, 1 tail). 
In model B (upper band), the large amplitude of the 
curvaton correlator determines a positive total correla- 
tion cos A « 0.5. Together, the curvaton and axion 
isocurvature amplitudes constitute a significant fraction 
of the total amplitude, so that the entropy-to-adiabatic 
ratio is much larger than zero (i? ~ 0.75). In summary, 
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FIG. 11: Cumulative diflerence between model B and model 
A (upper bands) of the quantity = — InL, for the 

WMAP TT (green/dotted hne in the top panel) and TE data 
(blue/dashed line in the bottom panel) in units given on the 
left vertical axes. Their sum is given in the top panel by 
the thick black solid line. In the top panel, we superimpose 
the two corresponding best-fit TT spectra for model A (thin 
black line) and model B (thin cyan/light gray line) with units 
on the right vertical axis. In the bottom panel, we superim- 
pose the two TE spectra (same notation). To compute the 
difference as a function of the multipole £, we use only 
the diagonal elements of the data covariance matrix (for the 
MC analysis, we, of course, included also the off-diagonal el- 
ements, which however contribute only a few percent). We 
also plot the binned WMAP TT (top panel) and TE (bottom 
panel) errorbars, as a guide to the eye to appreciate the dis- 
criminative power of the WMAP data, especially around the 
first acoustic peak in the temperature spectrum. Model B is 
a better fit to the TT SW plateau, since its power there is 
smaller, but its TE spectrum in this region fits the WMAP 
data worse. Model B cannot reproduce the overall shape of 
the first acoustic peak in temperature with enough accuracy, 
and its goodness-of-fit is correspondingly worse. 



model B (upper band) shows a complex superposition of 
modes with a subtly balanced contribution of adiabatic 
and isocurvature components. 

We have also performed a MC analysis for the lower 
green/lightly shaded band in Fig. of model A. As ex- 
pected from the arguments given above, the quality of the 
best fit is poor (— InL* = 793.4) as one can see from Ta- 
ble ^3 a-nd Fig.^1 because the small value of 0* for this 
band gives rise to a large axion contribution according 
to Eqs. 144|l . H49|l and H54|l. Furthermore, the curvaton 
amplitude Ac turns out to be much larger than the am- 
plitude Ai from the inflaton. This can be understood by 
observing that, in contrast to Ai, the amplitude Ac from 
the curvaton can be large and still give a small positive 
or even a negative contribution to the total power in the 
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FIG. 12: As in Fig. El but for the dimensionless correlator 
cos A and the entropy-to-adiabatic ratio B evaluated at the 
pivot scale kp (see Eq. 1571 '). (The HZ pure inflaton model 
is not included.) The sharp drop for _B<0.11 encountered in 
model A is a numerical feature due to the lower boundaries 
of our MC run. The curve must accordingly be interpreted as 
indicating an upper limit only, which is given in Table ITTTl 
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FIG. 13: Contours containing 68% and 95% of the probability 
for the joint posterior pdf for the upper band of models A and 
B for the entropy-to-adiabatic ratio B and the dimensionless 
correlator cos A (same notation as in Fig. (HJ. 



SW plateau because the curvaton correlation is positive 
and thus subtracts power from large scales. This is im- 
portant since, in this case, the axion contribution to the 
plateau is very large leaving little room for other contri- 
butions, while, at smaller scales, the axion isocurvature 
spectrum becomes negligible and the total temperature 
spectrum must necessarily be dominated by either the 
curvaton or the inflaton adiabatic contribution. So one 
of the two amplitudes Ai or Ac has to be large enough 
and, as we saw, this can happen only for Ac- There- 
fore, at smaller scales, the spectrum is dominated by the 
curvaton adiabatic part. The best-fit inflaton amplitude 
Ai, on the other hand, corresponds to the lower limit 
of our parameter space as one can see from Table IIVI 
The reason is that the inflaton contribution to the power 
spectra is always positive with an unsuppressed (by fac- 
tors of / or (r^LSP + ^B)/^m) adiabatic part. Thus the 
inflaton amplitude Ai must be very small in order not 
to overpredict the large-scale power. Furthermore, the 
TE spectrum has a very pronounced reionization bump, 
which results from a rather early reionization epoch and 
the large axion isocurvature contribution. 

The lower band of model B is totally incapable of pro- 
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TABLE IV: Best-fit parameter values for the lower band of 
models A and B. An asterisk indicates that the corresponding 
parameter has reached the limit of our top-hat prior in the 
MC run, Ho is again in km sec-^ Mpc~^ and L, is the best-fit 
likelihood. Note that, due to the large value of flA = 0.91 and 
the small value of Qm in the best fit of model B (lower band), 
the approximation used in deriving Eq. 1621 is no longer valid. 



ducing a spectrum in reasonable agreement with the data 
(see Table Uvjl . For values of Ai , Ac corresponding to this 
band, the axion contribution is huge and gives tempera- 
ture fluctuations at large angular scales which are a few 
orders of magnitude larger than what is observed (see 
Fig. 113). 

Regarding the issue of non-Gaussianity, we note that 
our parameter /, defined in Eq. (f^ . is » 10^^. There- 
fore, non-Gaussianity from the curvaton partial curva- 
ture perturbation is well within the current bounds from 
WMAP lli (see Ref. jl^). Actually, the values of / 
in our model are high enough to ensure that this state- 
ment remains true even in the limit of pure curvaton. 
Moreover, the non-Gaussianity of the isocurvature per- 
turbation in the axions is also negligible. This is because 
the perturbation S9 = 77*/27r0* acquired during inflation 
by the initial misalignment angle 9 (see e.g. Ref. is 
always much smaller than 9. As a consequence, terms 
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FIG. 14: Best-fit power spectra for tfie lower band of model 
A. The base parameters are according to Table ITVI and the 
notation is as in Fig.|U] 



FIG. 15: Best-fit power spectra for the lower band of model 
B. The base parameters are according to Table II VI and the 
notation is as in Fig.|U] 



which are second order in this perturbation can be safely 
neglected (see Ref. (Tsj ) . Finally, the non-Gaussian com- 
ponent from the inflaton is also neghgibly small. 

We have performed two other MC runs with the same 
value for A = 10~^ as in model A and k shghtly larger or 
smaller, i.e. k = 10~^ or k = 10~^, recovering a behavior 
which is qualitatively similar to the behavior of model 
A. We have thus chosen to present our results for this 
particular value of k (= 3 x 10~^) as representative of 
the behavior of this class of models. We have also tried a 
slightly larger value of A (= 3 x 10"'') for the same value 
of K = 3 X 10^^ as in model B, and found a behavior 
qualitatively similar to model B. 



3. Bayesian model comparison 

So far we have been concerned with the problem of 
deriving parameter constraints from the data, given an 
underlying model for the generation of the primordial 
fluctuations. Models A and B (upper bands) actually 
both belong to a continuous class of models (belonging 
to our PQ model) which is parameterized by k and A. 
However, since in this work we have chosen to fix the 
values of k and A, we may as well consider models A and 
B (upper bands) as two discrete disconnected models. 
The question is then to compare these two models with 
the standard pure inflaton HZ model and decide which 
of these three models is most favored by data. It should 
be stressed that model comparison (or model testing) is 
a different issue than parameter extraction, and indeed it 



represents a further step in Bayesian inference. In fact, it 
can very well be that model testing arrives at a different 
conclusion than parameter estimation. Indeed, it can be 
that the estimated value of a parameter under a model 
All is far from the null value predicted by model M.2i but 
All is disfavored against A^2 by Bayesian model testing, 
a fact sometimes called "Bartlett's paradox" 74]. This 
is exactly the case for Ac in model B (upper band) com- 
pared to the pure inflaton model with flat spectrum of 
perturbations, as we will show below. 

Sampling statistics (i.e. the frequentist approach to 
parameter estimation) uses the notion of the goodness- 
of-fit test to assess the viability of a model without the 
need of specifying an alternative hypothesis. This usu- 
ally reduces to the statistics for the observed data, 
presuming the model under consideration, M, is true. 
M is then rejected if the value of the goodness-of-fit falls 
above a certain threshold in the tail of the distribution. 
If we take this criterion at face value and use the 
statistics for the WMAP data, the best-fit pure inflaton 
ACDM model with n, ^ 1 having = 1431 for u = 1342 
degrees of freedom (see last paper in Rcf. 2J) would be 
rejected with a type I error (i.e. probability of falsely re- 
jecting a true model) of about 5%. Notice that this does 
not mean, as sometimes stated, that "the probability of 
the model is 5%" . We will see below that Bayesian model 
comparison is more informative and can give useful guid- 
ance for model building. 

It is clear that models with a very poor best fit can be 
discarded simply by inspection. An extreme example is 
certainly the lower band of model B presented above. 
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FIG. 16; Illustration of the Laplace approximation to the 
(non-normalized) posterior used to compute the model evi- 
dence. For our base parameters, we plot the logarithm of 
the ID marginalized posterior from the MC samples (crosses 
corresponding to the histograms in Fig. along with the 
corresponding Laplace approximation (solid smooth lines) of 
Eq. (EU for the upper bands of model A (black) and model 
B (cyan/light gray). Clearly, the Laplace approximation is 
very good for model B, and of acceptable quality for model 
A. All the curves are normalized to zero at their peak value. 
In particular, Ac is a rather non-Gaussian direction for model 
A (upper band), since the posterior from the MC samples 
only gives an upper limit on this parameter. 



The goodness-of-fit for the lower band of model A is 
also very poor, even though this is not readily distin- 
guishable from Fig. ^1 due to the logarithmic scale. In 
fact, the best-fit point has — InL* = 793.4, compared to 
— InL* = 721.2 for the upper band of this model. Again, 
we can dismiss the lower band without further analy- 
sis. However, we need a more powerful tool if we are to 
decide, on the basis of the available data, between the 
viability of our PQ model as opposed to the pure infla- 
ton HZ model. Bayesian inference offers a natural tool 
for model comparison in the form of the evidence in fa- 
vor of the model (see Appendix for details and precise 
definitions). 

We thus compute the Bayes factor for models A and 
B (upper bands), comparing each of them to the scale- 
invariant (i.e. Jii = 1.00) pure inflaton model, which we 
call in the following the HZ inflation model. We approxi- 
mate the integrals involved in the evaluation of the Bayes 
factor by using the Laplace approximation (see Eq. HA9|l). 
We must first check that a multi-dimensional Gaussian 
is a reasonable approximation to the actual shape of our 
(non-normalized) posterior pdf. This is shown in Fig. 1161 
where we plot the logarithm of the ID marginalized non- 



normalized posterior from the MC samples along with the 
corresponding Laplace approximation. For model B (up- 
per band), the Gaussian approximation is quite accurate 
along all directions, while for model A (upper band) we 
notice that Ac is a rather non-Gaussian direction, which 
is not surprising since this model only gives upper bounds 
on the amplitude Ac- However, inspection of the figure 
does seem to support the view that it is reasonably ac- 
curate to use the Laplace approximation to compute the 
model evidence. Another qualitative criterion is the sim- 
ilarity of the marginalized ID posterior and the mean 
posterior. If the posterior is a multi-dimensional Gaus- 
sian, then the two curves coincide (see e.g. ReL 64]). 
From Fig. El we can indeed confirm that the two curves 
are very similar, again strengthening the conclusion that 
the Laplace approximation is legitimate in our case, more 
so for model B. 

We first compare the upper band of model B {Aii 
in the notation used in the Appendix) against the HZ 
inflationary case (A^2), and compute the Bayes factor 
Bi2 in favor of model B (upper band). The term £12 
(see Eq. HA13|) ) is just the difference of the best-fit log- 
likelihoods yielding £12 = —10.9, and clearly disfavors 
model B (upper band), whose fit is worse. The term C12 
(see Eq. (|A14ll ) describes the ratio of the occupied vol- 
umes in parameter space by the posterior pdf's of the 
two models times a factor taking into account the dif- 
ferent dimensionality of the two parameter spaces and is 
found to be C12 = 3.0. Further considerations are needed 
to evaluate ^12 (see Eq. HA15|l ). the term which reflects 
the prior available volume of parameter space under each 
model. In the present case, we do not need to specify the 
top-hat range of the priors on the parameters which are 
common to both model B (upper band) and the HZ in- 
flation model. This is because, whatever prior belief we 
hold about the possible range for these parameters, it will 
be the same for both models. The parameters common 
to both models are lub, ^^cdm, h, Zr and Ai. In the PQ 
model, the prior range for the CDM abundance actually 
applies to the axion component of CDM only, but the 
difference, which is caused by the presence in the uni- 
verse of the LSPs, is very small and insignificant for our 
considerations here. Also, we could actually assign to the 
inflaton amplitude Ai two different prior ranges for the 
PQ and the HZ inflation models if we had any reason to 
believe that the ranges can be significantly different in 
the two cases. Here we take the view that Ai is essen- 
tially a free parameter in both models, and thus whatever 
prior range we assign to it will cancel out from !Fi2. As a 
consequence, the only prior range which does not cancel 
out from J-"i2 is the one for the extra parameter of our 
PQ model, i.e. Ac- So we have 



where is the top-hat prior range of A^. for the PQ 
model which is allowed in model B (upper band). The 
bounds of the prior on A^ correspond to the limits on 
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TABLE V: Results of the Bayes factors analysis for comparing 
the PQ models A and B (upper bands) to the HZ inflation 
model. When considering CMBR data only, the odds are in 
favor of the HZ model against the PQ models. In particular, 
model B (upper band) is very strongly disfavored. 

Ac in the upper band of model B. The lower limit is of 
no importance, since we can simply set it equal to zero. 
The upper limit is achieved on the boundary of the upper 
band, which gives A^'^^ « 0.1. From these considerations 
and writing Ac in units of 10^"' (which are the same units 
used in the covariance matrix), we have A Ac ~ 10"*, and 
thus T\i — —9.2. Putting everything together, we find 
ln_Bi2 = —17.1 and thus the Bayes factor disfavors model 
B (upper band) against the HZ inflation model with odds 
of about 10^ against 1. Notice that the reason for such 
high odds comes partly from the worse quality of fit of 
model B (upper band), and partly from an "Occam's 
razor" penalty of the PQ model, because it introduces 
a new scale in the problem (the curvaton amplitude 
which has a very wide prior range. We comment further 
on this aspect below. 

We now compute the Bayes factor for the upper band 
of model A {M.\) against the HZ inflation model {M.-2). 
As mentioned above, in this case we expect the Laplace 
approximation to be less accurate, because A^ is now a 
non-Gaussian direction. The term £12 is now slightly in 
favor of model A (upper band), since its fit is marginally 
better than the one of the HZ inflation model, giving 
£12 = 0.5. From the covariance matrices of the two 
models, we obtain C12 = 4.8, while the same reason- 
ing as above gives again a very small value for T\i as 
a consequence of the large allowed prior range of Ac, i.e. 
T\2 = —9.2. This last term again heavily penalizes the 
PQ model, resulting in Ini3i2 = —3.9, or odds of 50 : 1 
against the PQ model A (upper band). The Laplace ap- 
proximation is however likely to underestimate the actual 
volume occupied by the likelihood function in parameter 
space, due to the fact that Ac is a rather non-Gaussian 
direction for model A. As a consequence, the above odds 
can be interpreted as an upper limit for the evidence 
against model A. We summarize these results on the PQ 
models A and B (upper bands) in Table [V] 

One must be very careful when interpreting the above 
results for the evidence. In fact, the HZ inflationary 
model, which we used for the comparison, is a natural 



benchmark model as far as the CMBR data are con- 
cerned. However, one must keep in mind that the PQ 
model which we describe in this work addresses many 
fundamental issues which lie outside the scope of the phe- 
nomenological HZ inflation model, such as the strong CP 
and ji problems of MSSM, the generation of the observed 
BAU and the nature of the CDM in the universe. Our ap- 
proach was to consider fundamental (SUSY GUT) mod- 
els of particle physics within the cosmological framework 
by merging together requirements and constraints both 
from the particle physics and the cosmology side. In gen- 
eral, it is clear that any viable particle physics model has 
many free parameters about which little or no experimen- 
tal information is available at the moment, such as the 
parameters of the electroweak Higgs sector, the sparticle 
mass spectrum or the boundary conditions from super- 
gravity which also determine the radiative electroweak 
symmetry breaking. Nevertheless, we do have some indi- 
cation about their order of magnitude by applying crite- 
ria of simplicity, naturalness and elegance to our funda- 
mental theories. At this stage, the exquisite cosmologi- 
cal data nowadays at our disposal can provide significant 
constraints on the parameter space of the model. Our ev- 
idence calculation, in fact, takes into account only a part 
of our knowledge (i.e. the CMBR data) neglecting all the 
other issues which are addressed at a fundamental level 
by our model. It seems fair to say that, on the whole (i.e. 
considering both cosmology and particle physics), the PQ 
model presented in this work aims at a broader explana- 
tory power than a phenomenological infiation model. We 
thus conclude from our analysis that the present CMBR 
data can strongly disfavor certain regions of parameter 
space, as it is the case for model B. In this case, we 
found robust evidence that a mixture of curvaton and 
inflaton contributions to the cosmological perturbations 
is in disagreement with the present CMBR data. On 
the other hand, the odds against model A should be re- 
garded while keeping in mind the above considerations. 
In conclusion, it seems to us that, at the present stage, 
we cannot reject the possibility of a subdominant cur- 
vaton contribution to predominantly inflaton-dominated 
adiabatic perturbations. 

On a more phenomenological level, our treatment of 
the evidence highlights a generic feature of any model 
which introduces a second (or several) scale-free param- 
eter(s) to describe an extra non-adiabatic component, 
namely that Occam's razor of Bayesian model compari- 
son will always penalize such a model with respect to the 
pure inflaton single-field HZ infiationary model by assign- 
ing to it a very small J-12 term. This reflects the fact that 
the extra amplitude parameter (describing an isocurva- 
turc contribution) can a priori assume any value at all, 
and therefore there is no hard-wired justification why its 
value should be smaller than 10~^, or indeed close to but 
different from zero. So, from a Bayesian point of view, it 
is certainly unjustified to increase the model's complex- 
ity to achieve only a minuscule gain (if any at all) on 
the quality of fit. Traditionally, most attention has been 
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devoted to the maximum likelihood value as the crite- 
rion to judge whether a certain new parameter is useful 
or not. But, from the point of view of model building 
and Bayesian model testing, restricting the prior volume 
of parameter space could end up to be as useful, for an 
inflationary model which would predict the value of Ai 
to be in the ball-park of 10~^ would have very favorable 
posterior odds against a model in which is essentially 
a free parameter. 

Finally, the example of model B (upper band) above 
strikingly illustrates that Bayesian "credible intervals" 
cannot be interpreted as "significance levels", for, just 
by looking at the constraints on the amplitude Ac for 
the upper band of model B, one could have (erroneously) 
deduced a 6(t detection of a non-zero curvaton ampli- 
tude. Clearly, from the poor quality of the best fit 
(— Ini* = 732.6), it would have been immediately obvi- 
ous, without the need of computing the Bayes factor, that 
this particular model could not be favored by data. How- 
ever, the conceptual point remains: the question whether 
a certain parameter value differs from the null reference 
value cannot be answered by looking at the confidence in- 
tervals (c.i.), but must be tackled by proper model com- 
parison procedures. While this point is certainly clear to 
data analysts working in the field, it seems appropriate 
to stress it once more in view of a widespread misinter- 
pretation of this concept. 



VIII. CONCLUSIONS 

We considered a simple and concrete SUSY GUT 
model which automatically and naturally solves the 
strong CP and /i problems of MSSM via a PQ and a 
continuous R symmetry. This model also leads to the 
standard SUSY realization of hybrid inflation. The PQ 
field of this model, which corresponds to a flat direction 
in field space lifted by non-renormalizable interactions, 
can act as curvaton contributing together with the infla- 
ton to the total curvature perturbation in the universe. 
In contrast to the standard curvaton hypothesis, we did 
not suppress the contribution from the inflaton. 

The CDM in the universe predicted by this model con- 
sists predominantly of axions which are produced at the 
QCD phase transition. It also contains LSPs originat- 
ing from the gravitinos which were thermally produced 
at reheating and decayed well after BBN. For simplicity, 
we assumed that there are no thermally produced LSPs 
in the model. The baryons, which are generated via a 
primordial leptogenesis occurring at reheating, as well 
as the LSPs inherit the partial curvature; perturbation 
of the inflaton. This is different from the total curvature 
perturbation, which receives a contribution from the cur- 
vaton too. Therefore, the baryons and LSPs carry also 
an isocurvature perturbation, whose correlation with the 
total curvature perturbation is mixed. The axions, as 
usual, contribute only to the isocurvature perturbation. 
The resulting total isocurvature perturbation has mixed 



correlation with the adiabatic one. 

Most of the parameters of the model but two were 
chosen by using criteria of naturalness and simplicity. 
We considered two representative cases for the two re- 
maining parameters, k and A, and compared the pre- 
dictions with the first-year WMAP observations and 
other CMBR measurements. We found that, in one of 
the two cases (model B), the curvaton and axions con- 
tributions to the CMBR power spectra arc important, 
and that this leads to a significant disagreement with 
the data. In fact, Bayesian model comparison disfa- 
vors this case as compared to the scale-invariant pure 
inflaton model with odds of 10'' against 1. The other 
possibility which we studied (model A) predicts a pre- 
dominantly adiabatic power spectrum from the inflaton, 
where the curvaton and axions contributions are sub- 
dominant. We derived upper bounds for the amplitude 
of the partial curvature perturbation from the curvaton 
in this case (Ac<43.2 x 10~^ at 95% c.l.), and noticed 
that the interplay of the non-inflaton contributions re- 
sults in a later reionization redshift {zr = 13.3lg;3 
68% c.l.). Even though the best-fit likelihood for this 
case (— InL* = 721.2) is better than in the pure inflaton 
HZ case, evaluation of the evidence under the Laplace ap- 
proximation gives odds of about 50 to 1 against model A 
compared to the pure inflaton HZ case. These odds must 
be interpreted with caution, since they do not take into 
account the fact that the PQ model aims at a more fun- 
damental explanatory power and that it addresses many 
particle physics issues which are outside the scope of in- 
flationary models. 

In summary, we have shown that certain regions of the 
parameter space (k w 3 x 10"^ and 10"*^ A^3 x 10"*) 
can be excluded on the grounds of present-day CMBR 
observations. Quantitative bounds on the allowed val- 
ues of K and A could be derived by treating them as free 
parameters in the MC analysis, an exploration left for 
future work. Our approach - embedding particle physics 
model building in the cosmological framework - pursued 
the issue of merging together fundamental theories and 
cosmological constraints in a realistic and viable model 
for the generation of the cosmological perturbations. It is 
encouraging that modern cosmological observations are 
now informative enough as to give useful and robust guid- 
ance along this path. 



ACKNOWLEDGMENTS 

We thank K. Dimopoulos, R. Durrer, S. Leach, D. 
Lyth, Ch. Ringeval, M. Sakellariadou and D. Wands for 
useful discussions. We are particularly grateful to K. Di- 
mopoulos and D. Lyth for communicating to us their re- 
sults on the randomization regime for the curvaton field 
prior to publication. This work was supported by the 
European Union under RTN contracts HPRN-CT-2000- 
00148 and HPRN-CT-2000-00152 and was performed on 
the Myri cluster owned and operated by the University 



24 



of Geneva. R.T. was partially supported by the Euro- 
pean Network CMBNet and the Swiss National Science 
Foundation. 



APPENDIX A: 



BAYESIAN INFERENCE: 
PRIMER 



1. Bayesian parameter estimation 

Bayesian inference is based on Bayes' theorem, which is 
nothing more than rewriting the definition of conditional 
probability: 



V{A\B) 



PiB\A)V{A) 
V{B) 



(Bayes' theorem). (Al) 



In order to clarify the meaning of this relation, let us 
write (a vector of d parameters under a model Ai) for 
A and D (the data at hand) for B, obtaining 



V{e\D,M) 



L{-D\e,M)Tr{e,M) 

j^^L{T>\e,M)7T{e,M)de 
LCD\e,M)TTie,M) 

ViT>\M) 



(A2) 



where ft designates the parameter space (of dimension- 
ality d) under model A4. This equation relates the pos- 
terior probability V{6\T>-, M) for the parameters 6 of the 
model M given the data D to the likelihood function 
L(D|0, Al) if the prior probability distribution function 
tt{9,M) for the parameters under the model is known. 
The latter is called "prior" for short. The quantity in 
the denominator is independent of 6 and is called the 
evidence of the data for the model Ai [H^ . The evidence 
is the central quantity for model comparison, as we ex- 
plain below, but, in the context of parameter estimation 
within a model, it is just an overall multiplicative con- 
stant which docs not matter. In short. 



posterior = 



likelihood x prior 
evidence 



(A3) 



The prior distribution contains all the knowledge about 
the parameters before observing the data, i.e. our phys- 
ical understanding of the model, our insight into the ex- 
perimental setup and its performance, and in short all our 
prior scientific experience. This information is then up- 
dated via Bayes' theorem to the posterior distribution by 
multiplying the prior with the likelihood function which 
contains the information coming from the data. The pos- 
terior probability is the base for inference about 6. The 
most probable value for the parameters is the one for 
which the posterior probability is largest. 

Bayes' postulate states that, in the absence of other 
arguments, the prior probability should be assumed to 
be equal for all values of the parameters over a certain 



1 

7rie,M) = [Hie - 0„,in)i?(emax - e)\ n (A4) 

where H is the Heaviside step function and A0j = 
dmaxj — Snun,j > 0, V j. Clearly, a flat prior on 6 does 
not correspond to a flat prior on some other set a{6) 
obtained via a non-linear transformation, since the two 
prior distributions are related via 



7r(0,M) =7r(a,M) 



da{e) 
d9 ■ 



(A5) 



A recurrent criticism is that the final inference depends 
on the prior which one chooses to use. However, in a 
situation in which the data exhibit a clear preference 
for a certain value for a parameter, the posterior is ef- 
fectively dominated by the likelihood, and the choice of 
prior will not matter much. This is currently the case, as 
far as the CMBR is concerned, for high "signal to noise" 
parameters such as the baryon density. Constraints on 
other parameters such as the curvaton amplitude Ac in 
model A (upper band) of our PQ scenario are likely to 
depend slightly on the details of the chosen prior distri- 
bution. In other words, constraints on parameters which 
are not clearly determined will suffer from a certain de- 
gree of subjectivity, depending on what prior ^^{0,^A) 
we choose on the right hand side of Eq. ljA2p . This fact 
should be interpreted as a warning, telling us that the 
data are not powerful enough to clearly single out the 
parameter under consideration. 



2. Bayes factors 

Let us consider two competing models Aii and AI2 
and ask what is the posterior probability of each model 
given the data D. By Bayes' theorem we have 



7'(X,|D)cx7'(D|X,)^(M) (^-l,2) 



(A6) 



where 7'(D|A^i) is the evidence of the data under model 
Aii and 7r(Ali) is the prior probability of the ith model 
before we see the data. The ratio of the posterior odds 
for the two competing models is called Bayes factor |75| : 



(A7) 



range (0„ 



< e < e„ 



This is called a "flat prior" 



Usually, we do not hold any prior beliefs about the two 
models and therefore tt{A4i) = 7r(Al2) — 1/2, so that the 
Bayes factor reduces to the ratio of the evidences. The 
Bayes factor can be interpreted as an automatic Occam's 
razor, which disfavors complex models involving many 
parameters (see Ref. '^6^ for details) as we discuss below 
and demonstrate in the text. 
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The evidence in favor of M can be evaluated by per- 
forming the integral 



ViT>\M) 



L{T>\e,M)TT{e,M)de 
v{e\B,M)de, 



(A8) 



where V{6\'D,A4) designates the non-normalized poste- 
rior probability (i.e. the numerator in the right hand side 
of Eq. ljA2|l ). Computing the above integral from the MC 
samples is difhcult, since there will be very few or no sam- 
ples in the tails of the likelihood. There are however a 
number of approximate methods which can be applied 
[t^. Most of them rely on the fact that, for a large 
number of data points, the likelihood function will tend 
to be a multi-dimensional Gaussian distribution. One 
simple approximation is then to expand the logarithm of 
the non-normalized posterior to second order around its 
peak, which (for flat prior) occurs at the best-fit param- 
eter choice 0,, where the likelihood is maximized. We 
obtain 



In 



'P{e\B,M) 

vie,\-D,M) 



-lio-0. 



^C-i(6»-6>,), (A9) 



where C is the covariance matrix of the model M eval- 
uated at the best-fit point, which can be estimated from 
the MC samples. This is called Laplace approximation 
and can be expected to give sensible results if the non- 
normalized posterior is reasonably well described by the 
multi-dimensional Gaussian Eq. I)A9|I . It is then straight- 
forward to evaluate the evidence by using the approxi- 
mate form in Eq. ljA9|l for the non-normalized posterior, 
obtaining 

P(D|M) = / ri9\D,M)de 
Jn 

« (27r)^P(6>,|D,A^)VdetC. (AlO) 

For flat separable priors of the form in Eq. (|A4() we can 
simply write, abbreviating A6 = YYj^i ^^j^ 



vie,\-D,M) = L(p\e„M) 



A6»' 



(All) 



an expression which is still approximately correct even 
if we used non-flat priors, and interpret A9j as the typ- 
ical width of the prior pdf (say the standard deviation 
along the direction of the jth parameter for a Gaussian 
distributed prior). For a Gaussian prior, it is easy to de- 
rive an exact expression analogous to Eq. I|A11|) . but for 
simplicity we will stick to the above form. 

For the logarithm of the Bayes factor in the Laplace 
approximation, we finally obtain the following handy ex- 
pression: 



where we have defined 

_ , i(D|0«,A^i) 



£12 = In 



C12 ^ i ( In 



^12 



In- 



AO 



(2^) 

(2) 



(A13) 

, detC«\ 
In - — ) , (A14) 



det C(2) 



AO 



(1)' 



(A15) 



lnBi2 w £12 +C12 +Tr. 



(A12) 



where quantities referring to the model A4i {i = 1, 2) 
are indicated by a superscript (i). The term £12 is the 
logarithm of the ratio of the best- fit likelihoods. From a 
frequentist point of view, this quantity is approximately 
distributed, and thus it is common practice to apply 
to it a goodness-of-fit test to assess whether the extra 
parameters have produced a "significant" increase of the 
quality of fit. If the model M.i contains more parameters 
than the model M.2, then M.i should show an improved 
fit to the data, i.e. we should have £12 > 0, unless the 
extra parameters are useless, in which case £12 = 0. In 
any case, a goodness-of-fit test alone does not say any- 
thing about the structure of the parameter space for the 
model under consideration, since it is limited to the max- 
imum likelihood point. But Bayesian evidence does con- 
tain two further pieces of information, C12 and T12, which 
taken together are sometimes referred to as "Occam's 
factor". Here we prefer to consider these terms sepa- 
rately to help distinguishing their different origin. The 
term C12 describes the structure of the posterior shape 
in the Gaussian approximation. Since the determinant is 
the product of the eigenvalues of the covariance matrix, 
which represent the standard deviations squared along 
the corresponding eigenvectors in the parameter space of 
the model, it follows that if AI1 has a narrower posterior 
than M2, then C12 < 0, thereby disfavoring Aii. This 
apparent contradiction (how can a model with smaller 
errors display a smaller evidence?) is resolved when we 
take into account the term J- 12 , which describes the prior 
available parameter space under each model. The sum 
of the terms C12 and !Fi2 thus disfavors the model with 
the largest volume of "wasted" parameter space when the 
data arrive. A more complex model M.i - having a large 
number of parameters and thus a large volume of prior 
accessible parameter space - will naturally fit the data 
better due to its flexibility, i.e. we will have £12 > 0, but 
it will be penalized for introducing extra dimensions in 
parameter space, i.e. the sum C12 + J^i2 will be negative. 
In summary, the Bayes factor tends to select the model 
which exhibits an optimal trade-off between simplicity 
and quality of fit. 

It should be clear that the choice of priors plays an 
important role in Bayesian model comparison (testing) 
via its impact on the term Ti2- In particular, prior pdf's 
used in evaluating the Bayes factor must be proper, i.e. 
normalizable, so that we can impose the normalization 
condition 

/ 7r(6»,7W)d0= 1. (A16) 
Jn 
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Although generally a strong dependence on the choice of model features into the Bayes factor, as we show in the 
priors is regarded as suspicious, in this case, we should text of the paper, 
consider the role of J-12 as a way to implement a priori 
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